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Abstract 



This paper offers a precise analytical characterization of the distribution of returns for a 
portfolio constituted of assets whose returns are described by an arbitrary joint multivariate 
distribution. In this goal, we introduce a non-linear transformation that maps the returns onto 
gaussian variables whose covariance matrix provides a new measure of dependence between the 
non-normal returns, generalizing the covariance matrix into a non-linear fractional covariance 
matrix. This nonlinear covariance matrix is chiseled to the specific fat tail structure of the 
underlying marginal distributions, thus ensuring stability and good-conditionning. The portfolio 
distribution is obtained as the solution of a mapping to a so-called 0' field theory in particle 
physics, of which we offer an extensive treatment using Feynman diagrammatic techniques and 
large deviation theory, that we illustrate in details for multivariate Weibull distributions. The 
main result of our theory is that minimizing the portfolio variance (i.e. the relatively "small" 
risks) may often increase the large risks, as measured by higher normalized cumulants. Extensive 
empirical tests are presented on the foreign exchange market that validate satisfactorily the 
theory. For "fat tail" distributions, we show that an adequete prediction of the risks of a 
portfolio relies much more on the correct description of the tail structure rather than on their 
correlations. 
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1 Introduction 



Many problems in Finance, including risk management, optimal asset allocation and derivative 
pricing, require an understanding of the volatility and correlations of assets returns. In practice, 
volatility and correlations are often estimated from historical data and the risk dimension is rep- 
resented by the variance or volatility for a single asset and by the covariance matrix for a set of 
assets. In portfolio optimization, the variance of the distribution of returns is minimized by invert- 
ing the covariance matrix of returns. However, the covariance matrix is often ill-conditionned and 
unstable [Litterman and Winkelmann, 1998]. There are several origins to this unstable behavior, 
in particular the non-normality of the asset returns, i.e. large price variations are more probable 
than extrapolated from a normal estimation. For Levy or power law distributions with index less 
than two, the covariance matrix is not even defined. 

Option pricing and hedging relies on representing risk by a single scalar measure, the volatility. 
In practice, the volatility has the complexity of a time-dependent fluctuating surface defined as a 
function of the strike price and the time-to-maturity. Again, an important origin of this complexity 
stem from the fat tail structure of the underlying distributions as well as their non-stationarity. 

The need for models that go beyond the Gaussian paradigm is vividly felt by practionners, regu- 
latory agencies and is also advocated in the academic literature. Let us mention Geczy [1998] who 
proposes to use non-normal multivariate distributions to better assess factor-based asset pricing 
models : indeed, some factor models arc strongly rejected when relying on the presumption that 
returns or model residuals are independent and identically distributed multivariate normal, while 
they are no longer rejected when fatter tail elliptic multivariate distributions are used. 

The non-Gaussian nature of empirical return distributions has been first addressed by generaliz- 
ing the normal hypothesis to the stable Levy law hypothesis [Mandelbrot, 1963,1997; Fama, 1965]. 
Gaussian and Levy laws are stable distributions under convolution and enjoy simple additivity prop- 
erties that allowed the generalization of Markovitz's porfolio theory in a natural way [Samuelson, 
1967; Arad, 1975; Bawa et al., 1979]. More recently, a further generalization has been performed 
[Bouchaud et al., 1998] to situations where the marginal distributions of asset returns may have 
different power law behaviors for large positive and negative returns, with arbitrary exponents (pos- 
sibly larger than 2, i.e. not stable in the sense of Levy laws but only in a sense of large deviation 
theory) . 

This stable or quasi-stable property enjoyed by Levy and power laws, that are instrumental in the 
generalization of Markovitz's theory, presents however rather stringent restrictions. Indeed, these 
laws constitute the only solutions obeying the consistencey property [Kano, 1994], according to 
which any marginal distribution of random vectors whose distribution belongs to a specific family 
also belongs to the same family. This consistency property is important for the independence of 
the marginal variances on the order (number of assets constituting a porfolio, for instance) of the 
multivariate distributions (see the appendix A). Thus, the generalization of portfolio theory from 
Gaussian to Levy and then to power laws relies fundamentally on the consistency property [Kano, 
1994]. 

Elliptical distributions [Cambanis et al., 1981] provide a priori maybe the simplest and most nat- 
ural hope for describing fat tails and for generalizing portfolio theory to non-normal multivariate 
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distributions, solely on the basis of the measure of a covariance matrix. Elliptical distributions are 
defined as arbitrary density functions F of the quadratic form {X — {X)yYj^^{X — {X)), where X is 
the unit column matrix of the A'^ asset returns and E is a A'^ x A'^ dependence matrix between the A'^ 
assets which is proportional to the covariance matrix, when it exists. It was argued that the results 
of the CAPM extends to these elliptical distributions [Owen and Rabinovitch, 1983; Ingersoll, 1987] 
on the basis that, since any linear transformation of an elliptical random vector is also elliptical, the 
risk measure W'T,W (where W is the column vector of the asset weights) is positively proportional 
to the portfolio wealth variance. Thus the ranking of portfolios by risk adverse investors appears to 
retain their ordering. However, Kano [1994] calls attention to the fact that, in general, the density 
of a marginal distribution has not the same shape as the function F. This leads to an additional 
mispecification of the risk since it should be in principle captured both by the covariance matrix S 
and the shape and tail structure of the distribution. It is thus not possible to quantify the risk solely 
by T^'ST^. In practice, the marginal variance of one of the variable will depend on the number N 
of assets used in the portfolio. This is clearly an unwanted property for the chosen multivariate 
distribution. This inconsistency never arises for special cases such as the multivariate t-distribution 
or for mixtures of normal distributions. In appendix A, we clarify the origin of the problem and 
show that, for arbitrary elliptic distributions, the portfolio return distribution depends on the asset 
weights P, not solely via the quadratic form W'T,W : the weights W also control the shape as a 
whole of the portfolio return distribution [Sornette, 1998]. Minimizing the variance VF'EVF of the 
portfolio may actually distord the portfolio return distribution such as to increase the probability 
weight far in the tail, leading to increased large risks. This result provides a first cautionary note 
on any attempt to calibrate the portfolio risks by a single volatility measure. 

This paper aims precisely at ofi'ering a novel approach to address these questions. Our key idea 
is to use a representation that strives to remain as parsimonious as the Gaussian framework while 
capturing the non-Gaussian "fat tail" nature of marginal distributions and the existence of non- 
linear dependence. For this, a non-linear transformation maps the returns onto Gaussian variables 
whose covariance matrix provides a new measure of dependence between the non-normal returns, 
generalizing the covariance matrix into a non-linear fractional covariance matrix. In our approach, 
the linear matrix calculation of the standard portfolio theory is replaced by Feynman's diagram 
calculations and large deviation theory resulting from the fact that the portfolio wealth is a nonlin- 
ear weighted sum of the Gaussian variables. Notwithstanding this higher sophistication, all results 
can be derived analytically and thus can be fully controlled. In the present paper, we focus on the 
risk component of the problem, i.e. we assume symmetric return distributions. In other words, 
the average returns are sufficiently close to zero that the fluctuations dominates. Even if not true 
in reality, this approximation provides a precise representation of real data at sufficiently small 
time scales, corresponding to Sharpe ratios less than one. This situation is reasonably accurate for 
non-stock market assets, which do not have a long-term trend. To test and validate our theory, we 
thus use empirical data from the Foreign exchange. We leave for a future work the extension of our 
theory to the cases where the average return cannot be neglected. 

The paper is organized as follows. In section 2, we present our novel method for a full analytical 
characterization of the distribution of returns for a portfolio constituted of assets with an arbitrary 
multivariate distribution. In this goal, we introduce a new measure of dependence between non- 
normal variables, which generalizes the covariance matrix. The proposed nonlinear covariance 
matrix is chiseled to the specific fat tail structure of the underlying marginal distributions, thus 
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ensuring stability and good-conditionning. In our formulation, the multivariate distribution is 
normal in terms of a set of reduced variables that are the natural quantities to work with. In 
section 3, we show how the calculation of the distribution of returns for an arbitrary portfolio can 
be formulated in terms of a functional integral approach. In the appendices, we provide an extensive 
synthesis of the tools that allow us to determine analytically the full distribution of returns and 
work out in details the calculations for multivariate Weibull distributions. In section 4, we use the 
information captured by the distribution of returns of arbitrary portfolios to compare them with 
respect to their moderate versus large risks. We develop especially our analysis for uncorrelated 
assets with "fat tails" for which we compare different portfolios such as the ones that minimize 
the variance, the excess kurtosis or higher normalized cumulants. We also compare these portfolios 
with those determined from the maximization of the expected Utility. We show that minimizing the 
portfolio variance (i.e. the relatively "small" risks) may often increase the large risks, as measured 
by higher normalized cumulants. In section 5, we offer practical implementations and comparisons 
with data obtained from the Foreign exchange market. We validate clearly the performance of 
the proposed nonlinear representation. The comparisons between our theoretical calculations and 
empirical estimations of the cumulants of the portfolio return distributions are very satisfactory. We 
find in particular that, for a good representation of the risks of a portfolio, tracking the correlations 
between assets is much less important than precisely accounting for the "fat tail" structure of the 
distributions. Section 6 concludes with future extensions. 



2 Non-linear mapping to normal distributions 

The first step of our approach is to perform a non-linear change of variable, from the return Sx 
of an asset over the unit time scale r (taken as the daily scale for illustration below) onto the 
variable y{5x) such that the distribution of y is normal. For marginal distributions, this is always 
possible. We first illustrate this change of variables in simple cases and then proceed to the general 
multivariate situation. 



2.1 The case of a single security 
2.1.1 Non-centered case 

Let us consider the class of marginal distribution (probability density function or pdf ) p{5x) which 
can be parameterized as 

Pi5x)d5x = c4== d5x , (1) 

where C is a normalizing constant and /' denotes the derivative of the function f{Sx). Note that / 
must go to +00 for |(53;| ^ 00) at least logarithmically to ensure normalization. This distribution (|l|) 
describes so-called Von Mises variables [Embrechts et al., 1997]. This parameterization (||) covers 
all cases where the pdf has a single maximum. These are the relevant situations for essentially all 
securities. Note that for f{5x)/{5x)'^ for large \6x\, the pdf has "fat tails", i.e. the pdf decays 
slower than a Gaussian for large \5x\. 
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Let us then define the variable y by 



y = sign(&) ^\i{5^)\ . (2) 

The sign function is essential in order to keep possible correlations. In the case of fat tails for which 
yj\f{5x)\ « \6x\ for large \5x\, the variable y is the result of a "contraction" of 5x (such that its 
pdf is normal). Indeed, the pdf of y is such that the conservation p{y)dy = p{x)dx of probability 
holds and we obtain 

1 

P{y)dy = —f= e 2 dy . (3) 

Obviously, the form (0) has been chosen such that a change of variable recovers the normal pdf 
through a simple transformation. We stress that this parameterization (||) is fully general as any 
unimodal pdf can be put in this form. 



2.1.2 Example : the modified Weibull distribution 

Consider the case of the modified Weibull distribution where 

/(fe) = 2(^)', (4) 

and thus 

1 C CI / 1*^^! ^c 

p{5x)d5x = d5x — 1=— \5x\2~ e ~ for — cxd < < +oo . (5) 

The case, where the exponent c < 1, corresponds to a "stretched" exponential with a tail fatter 
than an exponential and thus much fatter than a Gaussian, but still thinner than a power law. 
Stretched exponential pdf's have been found to provide a parsimonious and accurate fit to the 
full range of currency price variations at daily intermediate time scales [Laherrere and Sornette, 
1998]. This stretched exponential model is also validated theoretically by the recent demonstration 
that the tail of pdf's of products of a finite number of random variables is generically a stretched 
exponential [Frisch and Sornette, 1997], in which the exponent c is proportional to the inverse of 
the number of generations (or products) in a multiplicative process. 

In this case (Q) , the change of variable (^) corresponds to 

y = sign((5x) |5x|2 . (6) 

For c < 2, y{5x) has a negative (resp. positive) curvature for 5x > (resp. for 5x < 0), i.e. 
is concave (resp. convex). This convexity reflects the contracting nature of the mapping that 

c 

allows to obtain a normal distribution, with a standard deviation = ■ This change of variable 
(|2|) has already been briefly mentionned in the literature. In a footnote of [Jorgensen, 1987], we 
find mentionned that, for distributions (P) with c > 2, the change of variable (P) gives a normal 
distribution. To our knowledge, the case c < 2 has not been investigated. 

The pdf (|5|) is called "modified" Weibull distribution, because it is slightly different by the distinct 
power in the pre-exponential factor from the standard Weibull pdf defined by 
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such that its cumulative distribution is exactly ex. This difference accounts for the fact 
that the change of variable (|6|) maps exactly the modified Weibull pdf onto a Gaussian pdf. This 
property does not hold exactly for the standard Weibull distribution. Notice that the modified 
Weibull distribution with c = 2 is nothing but the Gaussian distribution. This provides another 
incentive in this definition (^) as it retrieves exactly the Normal law as one of its member. From 
now on, we drop the term "modified" as we will only study (^). 

The Weibull distribution @ is not stable under convolution, i.e. the distribution of weekly returns 
is not exactly of the same form as the distribution of the daily returns. However, it presents an 
approximate stability in the sense that it is possible to define an effective exponent ct and scale 
XT such that the tail of the pdf of the distribution of return over T days is of the form (^) with c 
replaced by ct and X XT- The analytical procecedure to derive this result and numerical tests 
are given in the appendix B. 



2.1.3 Centered case 

We generalize (jl]) further by the following parameterization : 

p{5x)d6x = c4== e-^(-s-(^-) VWs^l--^)' d5x , (8) 

where m = E[(sign((5xa) \/|f(5x)|] and E[x] is the expectation of x. The parameterization (|8|) is such 
that, by the change of variable (|2p, expression (^) transforms into a standard centered Gaussian 
distribution 

dy _(.y-rnf 

p{y)dy = -1^ 6 2. (9) 



For the Weibull (stretched exponential) (||), this corresponds to 

1 C £_-! (Bign(iSx) |(5x|^— m)^ 

p{6x)d5x = d5x — 1=— \Sx\^~ e x'= for — cxo < fa < +oo , (10) 



where m = E[(sign(5x) |5x| 2]. The form ( [T0| ) of the Weibull exponential corresponds to the following 
cumulative distribution function 

P<(fa) = lerfc(!M±^). (11) 



2.2 General case of several assets 

Let us assume that the returns of assets over a period of T time intervals are measured : 
6x^\ (5x2^'*, 6x^^ for the first asset, 6x^i\6x^2^ , fa^^ for the second asset, up to 6x{^\ 6x2^\ 6xip 

^The complementary error function is defined as erfc(M) = dv exp(— u^), and has the asymptotic behavior 

erfc(it) — e~" for larse u > 0. 
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for the last asset. From these, the N marginal distributions of the returns for each asset are de- 
termined empirically either using a parametric or non-parametric representation. Call Pj{6x) the 
marginal distribution of the jth asset. In general, Pj{5x) is non-gaussian and exhibits fat tails, for 
instance described by a Weibull distribution. But we do not restrict ourselves to this special case 
and consider a general form for the Pj((5x)'s. 



2.2.1 Non-linear mapping 



As for the one security case discussed previously using (|2|), each marginal distribution Pj{6x) is 
mapped onto a gaussian distribution by performing a change of variable that is specific to each 
marginal distribution. We discuss here the symmetric case. For a given asset j, the new variable 
is such that 



Pj{Sx)d6x 



1 ^ 
e ay . 



(12) 



By construction, the variable y follows a Gaussian distribution of mean and variance 1. Intro- 
ducing the cumulative distribution Fj[x) 



Fj{5x) 



5x 



Pj{x')dx' , 



(13) 



the differential equation (12) for y{5x) is integrated into a general implicit equation giving y as a 
function of 6x : 



Fj{5x) 



For a Weibull distribution (|5|), expression (p^) reads 



erfc( 



(14) 



(15) 



We can rewrite (14) as 

y{6x) = \/2 erf-^ {2Fj{6x) - 1) (16) 

where erf~^ is the inverse of the error function. If Pj{6x) is of the form (|l|), the solution of ( |l6|) 
retrieves the expression (pi). 



We perform this change of variable (12,14,16) for each of the distributions Pj{5x). This leads to 
the transformed measurements y^\y2^^ , ...,y'>P for the first asset, y^\y^\ ...,y!^^ for the second 



asset, up to yi^\y2^\ 



yj^^ for the last asset. 



2.2.2 Nonlinear covariance matrix 

Having thus defined the variables y^^^ {j = l...A^) from the N assets, we construct the covariance 
matrix 

V = E[y y'] , (17) 
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with element Vab defined by 



Vab 



T ^ 



(18) 



i=l 



In (|T7|), we note for short y as the unicolumn matrix of size with elements y^^^ equal to some 
realization. 



For distributions that can be parameterized as we have seen that the change of variable (^6|) 
becomes expression (^). The covariance matrix (^) of the set of j/a's then corresponds to the 
following dependence matrix for the asset returns 6xa ■ 



Vab=E 



sign(&a)A/|f(5xa)| - ma sign(feb) J\f{6xb)\ - nib 



(19) 



where = E[sign(&a) y/\f{6xa 
covariance matrix Vat- 



The distribution of the ya is multivariate Gaussian with 



The covariance matrix (17,19) measures the covariance between the assets described by the "effec- 



tive" returns y. This definition (17,19) amounts to a non-standard measure of the covariance in 
terms of the variables Jx's. We use the name "nonlinear covariance" to recall that the change of 
variable 6x ^ y for fat tail pdf's Pj{6x) lead to a contraction, for instance a concave power law (|6|) 
for c < 2. In this case, the covariance of y amounts to estimate the covariance of fractional powers 
of the returns. 



2.2.3 Multivariate representation of the joint distribution of asset returns 

No approximation has been made until now. We have characterized the marginal distributions of 
the assets, defined new variables in terms of which the marginal distributions are Gaussian and we 
have calculated the covariance matrix of these new variables as the new measure of the dependence 
between the asset returns. 

Here comes the simplifying approximation. If the marginal distributions in terms of the new 
variables y^^^ are gaussian, nothing imposes a priori that the multivariate distribution of these 
variables is also a multivariate Gaussian distribution. From our construction, it is only guaranted 
that the projections of the multivariate distribution onto each y^^^ variable are Gaussian. 

However, a standard theorem from Information Theory [Rao, 1973] tells us that, conditioned on 
the sole knowledge of the covariance matrix, the best representation of a multivariate distribution 
is the Gaussian. In other words, the multivariate normal distribution contains the least possible a 
priori assumptions in addition to the covariance matrix (and the vector of the means when they 
are non-zero), i.e. is the most likely representation of the data. 

This implies that, conditionned on the sole knowlege of the covariance matrix ( [T7|Jl8| ), the best 
parametric, although not exact, representation of the multivariate distribution P(y) is 

P(y) = (2vr)-^/2 1^1-1/2 exp(-iy'y-iy) ^ (20) 

where \ V\ is the determinant of V. 
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Note that our "nonlinear covariance" approach is very different from the usual approximation which 
assumes that the asset returns Sx^-'^ themselves follow a multivariable Gaussian distribution. In 
particular, the non-linear change of variable ensures that the fat tail structure of the marginal dis- 
tribution is fully described and, in addition, leads to a novel more stable measure of the covariances. 
The normal structure of the multivariate distribution in terms of the y variables does not lead to 
a Gaussian multivariate distribution for the returns. 



From the expression (pOD, we obtain the explicit expression of the multivariate distribution of the 
asset returns by replacing the as a function of as given by (|l^) for each asset j and use 
the identity 

^(x) = P(y) ^ , (21) 

where ^ is the jacobian of the transformation from x ^ y. The Jacobian is the determinant of a 
diagonal matrix whose jth element is simply 

-|| = V?F,(fa«)e'»"''''' (22) 

as seen from (|l6|). This finally yields the following approximation for the multivariate distribution 
P(x) of the column vector x of returns of a given realization of the A'^ asset returns : 

N 

P(x) = |y|-V2exp (-ly' (y-i - /)y) n P^i^^'^) ' (23) 
where V is again the covariance matrix for y and / is the identity matrix. 

This "nonlinear covariance" approximation is exact for distributions with uncorrelated variables, 
in which case V = I. It is also exact for a Gaussian distribution modified by monotonic one- 
dimensional variable transformations for any number of variables; or equivalently, multiplication 
by a non-negative separable function. Note that the multivariate distribution (|2^) obeys automat- 
ically the condition that the corresponding marginal distributions Q are of the same analytic form. 
This corresponds to the consistency condition [Kano, 1994] discussed in the introduction. This 
approach leading to ( [2^ ) has also been introduced independently for the analysis of particle physics 
experiments by Karlen [1998]. 



2.2.4 Multivariate Weibull distributions 



Let us now apply this approach to Weibull distributions. The N assets are assumed to have their 
returns 6xa distributed according to the unconditional marginal distributions given by expression 
(^. With the change of variable (|2|), we construct the covariance matrix V defined by (|l^. This 

defines a signed and fractional dependence matrix for the asset returns. Since Vaa = E[[('5a;)^] 2], 

2 

we see that [V^a]~ has the same "dimension" as the usual variance but is fundamentally different 
in that 



E 



[(fe) 



in general . 



(24) 



i.e. the monovariate unconditional distributions derived from the muhivariate distribution by unconditionally 
integrating over all variables except one. 
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The expectation of a power is in general different from the power of the expectation. 

The multivariate Weibull exponential distribution of the asset returns is then given by expression 
(|2^ that we write explicitely 

P{6xi, 6x2, Sxn) Y\. '^^^i n ( 2 1*^^*! ^"^ ^^^'^ 



i=l i=l 

N N 



^^pf~^ ^ii Msign((5xi) |(5xi|2 - m;) (sign(&j) |&j|2 -mj)) . (25) 

^ i=ij=i ^ 

This expression obeys the requirement that the marginal (monovariate) distributions are Weibull 
exponentials. 



3 The distribution Ps{6S) of portfolio wealth variations 
3.1 Theoretical formulation 

We now restrict our study to symmetric multivariate distributions. As a consequence, all odd 
moments (in particular the first moment which is the expected return) and all odd cumulants are 
vanishing. This means that we are focusing on the aspect of risk embedded in the variance and in 
all higher even order cumulants. There is no trade-off between risk and return since the expected 
return is zero. We thus focus exclusively on the risk dimension of the portfolio. In another paper, 
we will present our results obtained for non-symmetric distributions. The results presented below 
are already sufficiently rich that we feel compelled to separate the discussion of symmetric and 
non-symmetric distributions so as not to make the presentation unecessarily cumbersome. 

To a very good approximation, it is harmless and much simpler to replace the returns 6xi{t) by 
{pi{t -|- 1) — Pi{t))/pi{t) where Pi{t) is the price of asset i at time t. Over reasonable large time 
intervals (e.g a year), one can neglect the variation of the denominator in comparison to the variation 
of the numerator 6pi{t) = pi{t + 1) — Pi{t). 

The total variation of the value of the portfolio made of N assets between time t — 1 and t reads 

N 

5Sit) = Y,Wi5p,{t) , (26) 
j=i 

where Wi is the weight of the ith asset in the portfolio. By normalization, we have X^i^i "^i — 1- 

In terms of the variables y^'s defined by (§) (resp. @ for the Weibull exponential case), the 
expression ( p6|) reads 

N 

5Sit) = J2w, signiy,) r\yf) , (27) 

i=l 

where f~^ is the inverse of / (suitably defined to avoid double valuedness) and respectively 

N 

55(i) = ^u;i sign(yi) Xi |yi|^ , (28) 
1=1 
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for the Weibull distribution for which fi{6pi) = l^l'^. 

All the properties of the portfolio are contained in the probability distribution Ps{6S{t)) of 6S{t). 
We would thus like to characterize it, knowing the multivariate distribution of the Spi^s (or equiva- 
lently the multivariate Gaussian distribution of the yj's) for the different assets. The general formal 
solution reads 



Psi5S) =cf[([ dy,) e-^ y'^'^ sfsSit) -j^^i^^miji) r\yl 
i=i V"' / V i=i 

Taking the Fourier transform Ps{k) = /+^ d6S Ps{6S) e'^^^^ of (|2|) gives 

Ps{k) = fl(J dy^ e-^ y'^-V+^fc El. sign(yO f-(y?) . 

i=l 



(29) 



(30) 



In the sequel, we present a systematic calculation method of (|30D, that can be applied in principle 
to a large variety of functional forms for /. In this paper, we restrict our analysis to the Weibull 
case, because it is probably one of the simplest non-trivial situation that allows us to make apparent 
the power of this approach. In addition, as will be shown by extended comparisons with empirical 
data, the Weibull representation provides a very reasonable description of the "fat tail" structures 
of empirical distributions. For the Weibull case, the term f~^{yf) is replaced by xl^il"- 

A further simplification of notation occurs when the exponent c is given by 

c = 2/q , with q integer and odd . (31) 

Then, 

sign{ui) \ui\~ = sign{ui) \uif = uf , (32) 
i.e. the sign function disappears. Expression ( p0| ) then becomes 

^^<*' = (2.)mLvv4 iJ ■ '''' 

We can absorb the terms Xi by introducing the new variable uf = xuh which shows that the 
covariance elements (in the formulation where the Xi do not appear) are proportional to x'^^'^- 



Expressions (27,2^) together with (^) exemplify the origin of the complexity introduced by the 
non-Gaussian nature of the distributions, namely the portfolio wealth is a nonlinear function of 
the variables y transformed from the asset returns and its distribution is expressed as a non- 
Gaussian multivariate integral, which is a priori not obvious to estimate. We notice however that 
(|3^ is similar to mathematical objects studied in another context: it is the same as the partition 
function of a (j)'^ field theory studied in particle physics, with N components and imaginary coupling 
coefficients ikwi. y is like a Gaussian field with interactions described by the second nonlinear term 
in the exponential. Notice that the case q = 3 corresponding to an exponent c = 2/3 is particularly 
interesting, because this value for c seems realistic empirically (see below the empirical section) and 
because the (p^ theory is the simplest non-trivial case leading to fat tails. 
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We shall characterize the portfolio distribution P{6S{t)) by its cumulants c„ defined by 

Psik) = exp(j2 . (34) 

The first cumulant c\ is the mean, the second cumulant C2 = — E[x])^] is the variance, the third 
cumulant C3 defines the skewness C3/C2 , the fourth cumulant C4 defines the excess kurtosis k = 
C4/C2. f\ For Gaussian distributions, all cumulants of order larger than two are zero. The cumulants 
are thus convenient ways to quantify the departure from normality. Furthermore, provided mild 
regularity conditions hold, they are equivalent to the knowledge of the full distribution. Since we 
deal here exclusively with symmetric distributions, all odd-order cumulants are vanishing and it is 
sufficient to calculate the even-order cumulants. 



3.2 The Gaussian case c = 2 

2 

When c = 2, the multivariate asset return distribution is Gaussian. The term sign(yi) |yi|~ becomes 
simply Ui and the integrals in (^) are standard Gaussian integrals that can be evaluated using the 
identity 

Y[{ J dxi)exp(^-^XiA-j^Xj + yiX^^ = \j e^viyi^iiVj) ■ (35) 
Applied to (|3^) and after taking the inverse Fourier transform, we get 

which recovers the classical result that the distribution of portfolio wealth variations 6S is uniquely 
and fully characterized by the sole value of its variance C2, expressed in terms of the covariance 
matrix of the asset price returns and their weight W in the portfolio [Markovitz, 1959, Merton, 
1990] : 

C2 = W'VW . (37) 



We now turn to the more general case c 7^ 2 for which the evaluation of the integral (pOj ) is much 
more involved. 



3.3 The case of uncorrelated assets 

We first consider the case where the covariance matrix is diagonal, which corresponds to an absence 
of correlations between assets. Then, the portfolio distribution is only sensitive to the intrinsic risks 
presented by each asset. We will show in the next subsection how to implement a perturbative 
diagrammatic expansion in the general correlated case. 

^The kurtosis is defined as the ratio of the fourth centered moment over the square of the variance E[{x — 
E[a;])*]/E[(x — E[a;])^]^ = 3 + 04/0^. In this definition, Gaussian distributions have a kurtosis equal to 3 while the 
excess kurtosis is zero. 
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For notational convenience, we adopt a change of variable which absorbs the x's in the exponential 
term of (|33|) : the term J2iLi Wi XiUt becomes Y^f=i Wi y^. As a consequence, the new yi variables 
acquire a non-unit variance equal to xf) with c = 2/q. 

In the diagonal case, the covariance matrix is given by 

V = diag{di} , where di = xj"^ ■ (38) 

The relationship between the variances of these y variables and the variances of the asset returns is 
obtained by noting that = which is proportional to the asset return variance. This illustrates 
that di plays the role of a "nonlinear variance" . 



3.3.1 Cumulants 



In the diagonal case, the multiple integral over the multiple asset contributions in (^) becomes 
the product of single integrals of the kind 



du e-^.^'^^^^"' 



(39) 



We consider the general case where we allow the exponent Cj = 2/gj to vary from asset to asset. 
From this integral, we derive in appendix C the exact and explicit expression of the general 2r-th 
cumulant of P{5S) : 



N 



C2r = J2Cir,q^)iwfdfr , 



(40) 



i=l 



where 



and r denotes the Gamma function. 



+ 



2!7ri/2 



(-1)^ 



2!7rV2 



, labeljfzjmg 
(41) 



The expression (|40D is valid even when the g^'s are real and the interaction term in ( |33D is propor- 
tional to sign(Mj)|nj|''' and thus applies to arbitrary Weibull distributions. 



For r = 1 (variance) and r = 2 (fourth order cumulant), the expression (??) gives 

C{l,q) = {2yV^)T{q + l/2) 
C{2,q) = (2^7^) r(2g+ 1/2) - (3 227vr)[r(g + 1/2)]^ . 



(42) 
(43) 



Note that the dependence of the cumulants in the diS as given in ( |40| ) enters through their qth. 
power d^ since, as we noticed before, Vah is proportional to the moment ([(5x)^]''). As already 
mentionned, \yabf (with (7 = |) has the same "dimension" as the usual variance. The dependence 
of the cumulants C2r{q) on the weights WiS enters only through the terms wfdl which are the same 
for all cumulant orders. 
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3.3.2 Tails of the portfolio distribution for c < 1 



We now characterized the extreme tails of the distribution of wealth variations of the total portfolio 
constituted of assets. We present results for the case where all assets have the same exponent. 
It is easy to reintroduce the dependence of Ci = 2/qi in the formulas, as it will become necessary 
when comparing to the empirical data in section 5. 

For this, we use in (|4|) to get 

.+00 (_i.2\m N s 

Ps{k) = e^p(Y. 7^C'(m,g)^K2d^rj . (44) 

m=l ^ ' i=l 

From the expression (??) of C{m,q) (see also Appendix C), we see that 

In C(m, q) -^m^+oo (| - 1) 2m In 2m + C'(2m) . (45) 
Two cases must be distinguished. 

1. q > 2: C{m,q) increases faster than exponentially for large cumulant orders 2m. As a 
consequence, the tail of the distribution Ps{SS) is controlled by the large cumulants. 

2. q <2: C{m, q) decreases with m and we cannot derive the structure of the tail from the large 
cumulants. 



For q > 2, i.e. c < 1 (stretched Weibull distributions), the behavior of large order cumulants 
embodies the structure of the tail of Ps{SS). From (pO|) , we see that 



C2miq) 



C{m,q) 



Max,- 



2 j<? 



(46) 



i.e. the high-order cumulants of the portfolio distribution are controlled by the single asset that 
maximizes the product wfdl of the square of the weight with the variance of the return. We call 
this maximum u^max^max- Keeping only this term (^) for the expression of the cumulants C2m{Q) 
of large order, we see that Ps{k) takes the form of the cumulant expansion for a single Weibull 
distribution with d1 replaced by ui^^x^max- We then deduce the expression for the extreme tail 



Ps{SS) ^|55|_oo exp 



r / \6S\ yi 




^u^maxUmax' ^ 


= exp 



\6S\ 



w 



maxXmax 



where Xn 



""max 



(47) 



For c < 1, we have thus shown that the extreme tail of the portfolio distribution is of the same 
functional form as the distribution of the individual assets with a characteristic decay rate it'maxXmax 
controlled by a single asset. The selection of this asset is a function of the parameters of the asset 
distributions and of the portfolio weights. Note that, for Wmax of order 1/A^, 



Ps{SS) ~ exp 



\6S\_y 



X 



(48) 



where x is a coefficient independent of A^. The extreme tail of Ps{6S) thus decays as the exponential 
of a power of A^ smaller than one. 



16 



This formulation of the tail of the distribution Ps{SS) of the portfolio wealth 5S allows one to 
characterize the risk by the single scale parameter x- This is in the same spirit as the portfolio 
theory for power law distributions [Bouchaud et al., 1998] in which all the different risk dimensions 
are encapsulated by the scale parameter of the power law distribution of the portfolio wealth 
variations. 

When the exponents Cj = 2/qi are distinct, the tail of the portfolio distribution is controlled by the 
asset with the smallest c, i.e. largest q. 



3.3.3 Tails of the portfolio distribution for c > 1 



For q < 2, i.e. c > 1, the above derivation does not hold. However, we can use the extreme 
deviation theorem [Prisch and Sornette, 1997] to determine the shape of the extreme tail of the 
portfolio wealth distribution. This theorem applies only for the exponent c > 1 and the stretched 
Weibull case previously analyzed is excluded from this analysis. This is due to the existence of a 
log-convexity condition f['{5xi) > 0, where /" is the second derivative of / defined in (||), as is 
shown in the Appendix D and in [Frisch and Sornette, 1997]. 

The Appendix D generalizes the extreme deviation theorem [Frisch and Sornette, 1997] to the case 
of non- identically distributed random variables. The result is 



Pn{SS) 



N-l 
TT 2 



* largo (55 



X UjLiWjXj 



,6S. 



2-c 



where 



X 



c{c - 1) X 



N-l 
2 



exp 



N 



i-y 



2(c-l)^ X 



c—^ — + c ■ 



(49) 



(50) 



This result is valid for c > 1. Note that the extreme tail of Pn{5S) decays as the exponential of the 
number of assets. This result gives the correct cross-over to the result (|48|) obtained for c < 1. 



This formulation of the tail of the distribution Ps{6S) of the portfolio wealth 5S again allows one to 
characterize the risk by the single scale parameter x, in a similar spirit as for power laws [Bouchaud 
et al., 1998], as already mentioned. The minimization of the risk in the tail is thus completely 
treated by finding the weights Wi that minimize x- Our theory below is more complete however, 
since the determination of all cumulants give us a characterization of the complete distribution and 
not only of its tail. 



3.4 The general case of correlated assets 
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3.4.1 Cumulants of the portfolio distribution with N assets 



We assume that all assets are characterized by Weibull distribution with the same exponent c = 2/q. 
Our goal is to compute the characteristic function 



1 



(27r)^/2 det 



N 



Yidui 



e 2 



lUV-^U+^kJ2,W^u''^ 



(51) 



For this, we introduce a perturbation analysis and the functional generator [Brezin et al., 1976; 
Sornette, 1998] defined by 



1 



(27r)^/2 det 

When the integral is a Gaussian {k = 0), we get 

P|(0,0) 

With the relationship 



we can formally express the characteristic function as 



ik , Wi 1 TT , T 



Ji=0 



This formulation is the most natural for a perturbative analysis of (^). 
In appendix E, we use the Feynman diagram method to calculate 



1 + ^77 2.^*77^+9^7 Z.^i^^A79 



5JJ <5Jf + ... 



j=o 



where we have defined the auxiliary coupling constant 

9q = q^-k ■ 



(52) 



(53) 



(54) 



(55) 



(56) 



(57) 



Up to second order in k (which will thus provide the variance of Ps{dS)), we get 



Plik) 



(<?-l)/2 



1 



1 1 



^ 1^ (g-20!(2!)2'+i(^! 



33) 



(58) 



The diagrammatic expansion used in appendix E becomes very useful for higher order terms in gq 
to obtain a systematic classification of their proliferation. A well-known result of diagrammatic 
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perturbation theory [Veltman, 1995] tells us that neglecting the disconnected diagrams, which 
appear beyond second order, corresponds to compute the logarithm of the characteristic function. 
Therefore, the set of connected diagrams at m-th order give us directly the m-th cumulant coefficient 
Cm defined by expression (|34|) . 



We give now the expressions of the second 02(8) and fourth cumulant 04(8) corresponding to q = 3, 
i.e. c = 2/8. The second cumulant is read from expression ([L59D in the appendix 



C2(3)=E 



QwiiVij) Wj + 9wi Vii Vi j Vjj w j 



(59) 



where Vij is the i,j element of the covariance matrix V between the variable yi of asset i and the 
variable yj of asset j. The sum is over all pairs of assets. The fourth order cumulant is 

C4(8) = 4!(8!)^ Y. '^ii'^i2'Wi3'^ii\^-^^L2^hi3^i2uy^^^^ 



T/.2. v- ■ y. ■ V- ■ V- ■ A ■ V - V- ■ ■ -I- 

23 «1«2 '^«2«3 '^«3*4 ^tiH ^ 24 *1*2 *1*3 ^ nU '^«3»4 ' 

2123 ^lj2^1«3^1M^2«2^3«3^4i4 ~^ ^ ^1 42 ^1«3 ^IM ^2*3 ^2*4 ^3*4 

The sum is carried over all possible quadruplets of assets. 

This result generalizes for higher order m-th cumulants and for arbitrary q as 

1 



(60) 



Cm{q) = m\{q\r E 



r<s=l 



(61) 



where the sum is carried over the set Qmio) of all the topologically inequivalent connected diagrams 
with m vertices of q legs as defined in the Appendix E and the symmetry factor S is of the form 
(see Appendix E) 



III IIL 

5({/J,K,}) = (2!)I^r.i''-n«r! n nrs\S,{{lr],{nrs]) . 



(62) 



r=l r<s=l 



3.4.2 Portfolio with two assets 



As an illustration, for the case of two assets (A^ = 2), Eqs.( p9| , pO| ) reduce to 

C2(8) = IbVl^wl + IWi^wl + {ViiV22)^ [6p?2 + 9/912] WiW2 , 

where the correlation coefficient is defined by 

Vl2 



The fourth cumulant is 



4! (8 



(a 



Pl2 



1 1 

24 ~ 4! 



^/VnV22 



)VxxV^2V22 + (^ + ^)Vi| 



(68) 
(64) 



iyl^w\w2 + V^2WlW2) 
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+ 2((a - ^)V-2 + ^^11^12 ^22)^/^?^^! , (65) 

where 

11111 

« = ^ + ^ + ^ + ^ + (66) 



If the two assets are not correlated, Vu = P12 = 0, expressions (|63|) and (|6^) recover the non- 
correlated expression (Eol) as they should, since 4!(3!)^ a = C(2,3) = 9720. 



For Vii = V22 = V, the expression ( p5| ) of the fourth cumulant simplifies into 

^4(3) r 4 4, , r / , \/i „2 M/„,,3„,, , „„ „.,3\ , o 2 r„ ^1 „2 M„.,2„,,2 



aK+u;^]+pi2[a-(-j + -)(l-/5f2)]Ku;2 + «^iw;^) + 2pf2[a-7T(l-Pi2)]«^I"'2 ■ (67) 



4!(3!)4y6 "'L-'i ' "-^j i /-iZL- ' 4!^^" > -i-:^; ' -rrzi- 4, 



4 Portfolio optimization for uncorrelated assets 



For non-Gaussian distributions, all the cumulants and not only the variance of the distribution of 
the portfolio wealth variations must be considered as relevant measures of risk. The variance as well 
as all higher order cumulants depend on the weights Wi of the assets constituting the portfolio with 
different functional forms. It is thus important to determine the relative variation of the cumulants 
when the weights Wi are modified. In particular, we show now that the portfolio which minimize 
the variance, i.e. the relatively "small" risks, often increases larger risks as measured by higher 
normalized cumulants. 



4.1 Minimization of the variance 



For simplicity of notation, we restrict to the case where all assets have the same exponent c = 2/g. 
It is straightforward to reintroduce the asset dependence of the q^s in the formulas. In order to 
take into account the normalization constraint Yli'^i = 1, we solve for the A^-th weight wn = 
1 — Y^fs^ Wi and we substitute it in the explicit formula for the variance 

C2(g) = (2g-1)!! Y.^1d\^[\-Y.wA d% 

1=1 \ i=l ) 

The variance being quadratic in the weigths Wi, the solution to the minimization problem is the 
solution to the linear algebraic system of — 1 equations in — 1 variables 

/ N-x \ 
Wid\ =\\-^Wi\d%,, i = l,...,N. 



i=l 



The solution can be written in a compact form by using the symmetric polynomials crl^'' {xi, 



,XN 



generated by ni^i(^ + ^ 



. . . + xn, ••• and cr^^ 



N 

n=0 ■ 



xi, . . . ,xi\f) such that cJq^^ = 1, a[" ' = xi + X2 + 



(TV) 



xiX2 - ■ ■ xn- The solution takes the form 
widl = W2d2 = . . . = w^d'j^ 



1; ■ 



a^^^ id" 



1 



(70) 
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The weights Wi that minimize the portfoho variance are inversely proportional to the corresponding 
asset variance df. 

Noting 

= -jq ' C^l) 
the cumulants corresponding to the weights that minimize the variance are 

cl{q)=C{r,q) • (72) 



4.2 Minimizing the higher-order cumulants 

The weights Wi that minimize the cumulants C2r{q) given by (EO) are 



widf-' = W2df' = ... = WNdf' = \ . (73) 

Note that these weights vary with the order r of the cumulant that is minimized. It is thus not 
possible to minimize simultaneously all the cumulants of order larger than two. This is in contrast 
to the result for the normalized cumulants discussed below. The weights that minimize the very 
large r +oo order cumulants approach asymptotically the values that minimize all the normalized 
cumulants of order larger than two, as given by (84) below. Thus, conclusions obtained for the 



normalized cumulants carry out for the cumulants in the limit r — > oo. 
The cumulants corresponding to (l73|) are 



where the Xj's are given by (JH^j. By definition, the two expressions (^) and ( [7^ ) coincide for r = 1, 
i.e. give the same minimum variance. 

It is interesting to compare the values taken by the cumulants for the weights that minimize the 
variance with the values taken by the cumulants for the weights that minimize a cumulant of given 
order 2r. For this, we study the ratio of the cumulants which is also the ratio of the normalize 
cumulants since the two expressions ( [7^ ) and (^) coincide for r = 1 : 

where 

X- ^ 
Xi = — — are normalized ^ Xj = 1 , (76) 

z2j j=i 
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and the defined by 

Differentiating C2^(9)/c™"'((7) given by ( |75| ) with respect to one of the Xj's, we find that the deriva- 
tive vanishes when all Xj's are equal to This corresponds to an extremum of all C2^(9)/c™"(9) 
equal to 1. This extremum is in fact a minimum. To see this, we parameterize 

X, = l(l + e,), (77) 

where the e^'s are small and sums up to zero {J2iLi = 0). We then expand ( [75|) in powers of e^'s 
and find 



i=l 



Thus, for unequal Xj's, the ratios C2^(g)/c2^*"'((/) are generically larger than one. In words, the 
values of the cumulants for the weights that minimize the variance are higher than those for the 
weights that minimize a cumulant of given order 2r > 2 : minimizing "small" risk increases large 
risks. 



4.3 Minimization of the excess kurtosis and higher normahzed cumulants 

Normalized cumulants provide a better measure of large risks than the (non-normalized) cumulants. 
The normalized cumulants are defined by 

= . (79) 

[02]"" 

The fourth normalized cumulant A4 is often called the excess kurtosis k. Recall that it is identically 
zero for Gaussian distributions. 

The quantitative deviation of a distribution from normality is given by normalized cumulants jf^pr • 
Indeed, the difference Vy{z) — g{z) between the cumulative distribution function of the sum of 
random variables and its Gaussian asymptotic value is given by [Gnedenko and Kolmogorov, 1954] 



where Qk{x) are polynomials parameterized by the normalized cumulants A„ of the distribution of 
the random variables. For instance, the two first polynomials are 

Qi{x) = \\^{l-x^), (81) 
o 

and 

Q2(x) = ^Aix^ + \{\x, - ^Ai)x3 + ( AaI - \\,)x. (82) 

A more straightforward way to recognize the role of the normalized cumulants is to work with the 
characteristic function defined in (|34[). Since k is the variable conjugate to 5x and the natural 
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scale for 6x is the standard deviation this means that the natural "dimensionless" variable is 
6x/y/c2, i.e. in the conjugate variable, it is k ^/c2 that we define as k. In terms of k, P become 

n^) = exp(-^ + E(-ir^P") . (83) 



Using ( |40| ) to construct the normalized cumulants X2m defined by (|79D, we find that the asset 
weights Wi that minimize X2m, irrespective of the order 2m, are given by 

{N),,q/2 ,g/2^ 



We recover (84) from the analysis of the tail of the portfolio distribution for c < 1 given in section 



( 3.3.2| ). The natural criterion is to find the weights Wi that minimize the characteristic decay rate 

given by (|3) : 

, for all w'js . (85) 



X of the tail of the portfolio distribution given by (| 



dwj 

After some calculation, we find that the weights that minimize x exactly those given by (p4[), 
that minimize the high order normalized cumulants. This confirms that the normalized cumulants 
are the relevant measures of the tail of the portfolio distribution. 

The expressions of the cumulants corresponding to the weig hts (g) are 

c^^^=NC{m,q)- . (86) 



J2i ^ 



If all assets have the same nonlinear variance di = d, then = c'^m orders 2m. Thus, 

minimizing the variance minimizes all normalized cumulants at the same time. This special result 
is not true in general as we now show. 



Using (76) and the definition (|7y) of the normalized cumulants, we get 

(87) 



AS 1 



m 

\ 



Using the Lagrange multiplier method to impose the normalization condition on the Xj's, we find 

that the values XiS that maximize the -jyj for m > 1 are all equal to for which -jyj = 1. For 

any other set of "nonlinear variances", the ratio is less than one. Thus, the weights that minimize 
the variance increase the higher normalized cumulants such as the excess kurtosis. 
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We now compare the cumulants of the portfoho with minimum variance to the cumulants of the 
portfoho with minimum normahzed excess kurtosis, for forming the ratio 

2m 

^2m _ \ ' 



With the definition (^J7l|), we get 



2m 



2m 

-2 



- (89) 



In particular, 



and 



iVj^Xf, (90) 



n\ (91) 



4 

-2 



We find that the ratio is a minimum equal to 1 when all Xj's are equal to For any other 

set of nonlinear covariance matrix, the variance of the portfolio which minimize the excess kurtosis 
is always larger or equal to the minimum variance. This is expected by construction. 

The ratio -Tyy is also found to be a minimum equal to 1 when Xj's being equal to XjN. This 

result generalizes to higher orders. For any other set of nonlinear covariance matrix, we find that 
the higher order cumulants of the portfolio which minimize the excess kurtosis are always larger or 
equal to the cumulants of the portfolio that minimizes the variance. 

4.4 Comparison between the excess kurtosis of the minimum-variance portfoho 
and the benchmark Wi = 1/N 

The benchmark with Wi = 1 /N has the following cumulants 

i 

We construct the ratio of the excess kurtosis of the portfolio with minimum variance to the excess 
kurtosis of the benchmark : 

A4 
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where the Xj's are defined by (71). 



For all Xj's equal, = 1 as it should. Changing all Xj's into their inverse change the ratio R into its 
inverse 1/R. This implies that the two situations are equally probable : either the excess kurtosis of 
the benchmark is smaller/larger than the excess kurtosis of the portfolio with minimum variance. 
Indeed, for each set of assets with Xj's for which R < 1, then the set of assets with inverse l/xj's 
gives R > 1. Finding the portfolio with minimum variance may thus either increase or decrease its 
excess kurtosis compared to that of the benchmark. 

Notice that for N = 2, Ris identically equal to unity for any possible choice of xi and X2- A portfolio 
constituted of two uncorrelated assets is thus such that the excess kurtosis of the benchmark and 
of the optimized variance portfolio are the same. This results is not true anymore for N > 2 for 
which R is usually different from one. 

It is also interesting to investigate the ratio of the cumulants 



(V) 
''2m 



-'2m nT2m ^'i 'i 



Expanding the Xj's around the equal nonlinear variance case Xi = -^(1 + e^) where the e^'s are 
small and sums up to zero J2iLi = 0) we get 

(V) N 

E^.'> (95) 



n/N) N ' 

^2m «=1 

up to second-order in e^'s. The higher-order cumulants of the portfolio with minimum variance are 
smaller than those of the benchmark. 



4.5 Synthesis 

We have obtained the following results. 

1. Minimizing the variance versus minimizing the normalized higher- order cumulants \2m '■ the 
portfolio with minimum variance may have smaller cumulants than the portfolio with mini- 
mum normalized cumulants but may have larger normalized cumulants (which characterize 
the large risks). Thus, minimizing small risks (the variance) may increase the large risks. 

2. Minimizing the variance versus minimizing the non-normalized higher-order cumulants C2m '■ 
the portfolio with minimum variance may have larger non-normalized and normalized cumu- 
lants of order larger than two than the portfolio with minimum non-normalized cumulants. 
Thus, minimizing small risks (the variance) may increase the large risks. 

3. Minimizing the variance versus the benchmark portfolio : the portfolio with minimum variance 
has smaller non-normalized higher-order cumulants than the benchmarck portfolio, even if it 
may have smaller or larger higher-order normalized higher-order cumulants. This situation 
is similar to the first case, i.e. it is preferable to minimize the variance than taking the 
benchmark, even if the resulting portfolio distribution is less Gaussian. 
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4.6 The expected utility approach 



The investor, starting the period with initial capital Wq > 0, is assumed to have preferences 
that are rational in the von-Neumann-Morgenstern [1944] sense with respect to the end-of-period 
distribution of wealth Wq + 5S. His preferences are therefore representable by a utility function 
uiWo + 5S) determined by the wealth variation 6S at the end-of-period. The expected utility 
theorem states that the investor's problem is to maximize E[m(Wo + SS)], where E[x] denotes the 
expectation operator : 

/ + 00 
d6S u{Wo + 5S) Ps{SS) . (96) 
-Wo 

u{W) has a positive first derivative (wealth is prefered) and a negative second derivative (risk 
aversion) . 

Consider first the case of a constant absolute measure of risk aversion —U"/U' = a, for which 
U{W) = — exp(—aW). In the limit of an initial wealth Wo that is sufficiently large such that 
the probability of ruin occuring in a one-period (i.e. 6S < —Wq) is negligible, expression ( p6|) 
transforms into 

y+OO s 

E[n(Wo + 6S)] = -e-'^^o P{k = ia) = -e""^" exp ( ^ a^" ) . (97) 

n=l ^ ' 



Due to the negative sign in front of the expression, maximizing E[it(Wo + i^S")] is equivalent to 
minimizing 'Y^=\ ^^fyr o^"", i.e. a weighted sum over the cumulants, each of them function of the 
asset weights Wi given by (^). For a small risk aversion a << 1, the sum in the exponential 
is essentially given by the first term 020? / 2. In this limit of small risk aversion, maximizing the 
expected utility retrieved the standard procedure of minimizing the portfolio variance. However, 
for larger risk aversions, the higher-order cumulants bring in non-negligible contributions. It is 
straightforward to solve for the optimization with explicit analytical formulas. 

Consider as another illustration the case where the utility is a member of the power (isoelastic) 
functions 

u{W) = with < 7 < 1 . (98) 

Then, assuming again that \5S\ < Wq for a single period (which is not at all restrictive an assump- 
tion in practice), we expand 

+00 I 

u{Wo + 6S) = {Wo + 5Sr = WSil + 6S/Wor = W^ Yl ( J'v (.^S/WoY . (99) 



+00 1 



Putting (H) in (|9|) gives 



mWo + SS)] = W^ E (^_2J)!(2.)! ni^S/Wo)'^] 



w^/i ^(^-^) ^2 7(l-7)(2-7)(3-7) C4 + 3c2 

^0 ^ ^ 2~ Wl 4! Wf~ " - 1 ^^""^ 
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where we have replaced the lower bound —Wq by —oo in the integral and E[((5S'/Wo)^] by C2/W0 
and E[{6S/Wo)^] by (04 + 3cl)/W^. Recall that the cumulants C2„ are given by ® with (??) 
and contain the dependence on the asset weights Wi. The parameters controlling the behavior of 
E[n(14/o + '^5')] are the exponent 7 and the N ratios df^'^/Wo of the asset price standard deviations 
normalized by the initial wealth. 

For large initial wealth Wq (compared to the one-period price standard deviations), only the first 
few cumulants need to be considered in the sum as the others are weighted by a negligible factor. In 
this limit, the best portfolio retrieves the optimal variance portfolio, since maximizing E[ii(VVo+'^'S')] 
corresponds to minimizing the first term C2/Wq in the expansion (100). 



In the other limit where the initial wealth Wq is not large compared to the one-period price standard 
deviations, E[ii(IVo + ^S)] receives non-negligible contributions from higher-order cumulants. The 
best portfolio is a weighted compromise between the different dimensions of risks provided by the 
different cumulants. 

An important insight obtained by this analysis is that the optimal portfolio depends on the initial 
wealth Wq, everything else being equal. This results from the existence of the many different 
dimensions of risks provided by the different cumulants which are each weighted by the appropriate 
power of the initial wealth. In other words, the initial wealth quantifies the relative importance of 
the high-order cumulants. 

Notice the parallel between the analysis of the two cases with constant absolute measure a of risk 
aversion and with a power (isoelastic) utility function : the relevant relative measure of the impact 
of the various cumulants (i.e. the importance of the tail of the distributions) is a for the first 
case and I/Wq in the second case. It is interesting to retrieve the standard variance minimization 
approach in the limit of small risk aversion or large initial wealth. In other words, when you use the 
standard variance minimization method, you implicitely express (perhaps unwillingly) a small risk 
aversion. This is logical since you drop all information on the large-order cumulants that control the 
large risks. In contrast, the full treatment incorporating the higher cumulants as relevant measures 
of risks allows us to respond to a much larger spectrum of trader's sensitivity and risk aversions. 



5 Empirical tests 

5.1 r ^ y transformation by Eg. (|T6| ) : examples and statistical tests 

5.1.1 Multivariate and marginal return distributions of six currencies 

In Figures |l],§,§J^,^ are shown the empirical bivariate distributions {N = 2) for pairs made of 
the Japanese Yen (JPY) and one of five other currencies, all quoted in US dollars. Each point 
{ri{t),r2{t)) is defined as the daily annualized return 

r.(t) = 250 In^i^^ (101) 

Si{t) 

where Si{t) is the price of currency i at day t. We use the index values i = 2 for the Japanese 
Yen and i = 1 for the other currencies which are respectively the Swiss Franc (CHF), the British 
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Pound (UKP), the Russian Ruble (RUR), the Canadian Dollar (CAD) and the Malaysian Ringgit 
(MYR). The time interval is from Jan. 1971 to Oct. 1998 except for the data with RUR where the 
time interval is from Jun. 1993 to Oct. 1998. 

The contour lines are obtained from Eq.(^) and corresponds to different probability levels. A given 
level gives the probability that an event (ri, falls inside the domain limited by the corresponding 
contour line. Three probability levels corresponding to 90%, 50% and 10% are shown for each pair 
except for the pair RUR-JPY where only the 50% and 10% levels are shown. In order to distinguish 
the contour lines from the data points, only one fourth of the data points (randomly chosen) are 
shown. 

In these figures, we also present the price time series Si{t) (top and right panels) and the corre- 
sponding monovariate distributions P{ri{t)). The scales of the axis for the bivariate and the abcissa 
of the monovariate distribution plots are chosen identical in order to highlight that monovariate 
distributions are obtained as projections of the multivariate distribution. The continuous lines 
correspond to the best fits of the Weibull pdf defined by (^) to the tail for large returns r of the 
empirical distributions shown as open circles. It can be compared to the best Gaussian fit shown 
as an inverted parabola in these semi-log plots. The values for the exponent c and the constant A 
are given in the figure captions. The fat tail nature of these pdf's is strongly apparent to the eye 
and is also reflected quantitatively by the fact that the exponents c's of the Weibull pdfs are all 
found of the order or less than one, while a Gaussian pdf corresponds to c = 2. These fits show 
that the distribution Eq.(|5|) provides an excellent representation for all six currencies. 

It is quite clear that the bivariate distributions are very non-Gaussian, as can be seen from the 
shape of the countour lines at the 90%, 50% and 10% confidence levels. For Gaussian distributions, 
the contour lines would be given by the equation r'V~^r = C, where C is a constant and V is the 
covariance matrix for the r variables. Depending on V, the contour lines would take the shape of 

• circles for Vn = V22 and V21 = V12 = 0; 

• ellipses with principal axis along the coordinates for Vn 7^ V22 and V21 = 0; 

• ellipses with principal axis tilted with respect to the coordinates for V21 7^ with the tilt and 
the eccentricity of the ellipses depending on Vn, V22J V21. 

• In particular, if Vn = V22 and V21 7^ 0, the ellipses are always tilted at ±45 degrees with 
the ratio of the small over large principal axis given by \/(l — |Vl2|)/(1 + |^i2|)- For perfect 
correlations IV12I = 1, the ellipse becomes a segment of straight line as expected for the 
dependence of two perfectly correlated variables. 

Consider first the 90% contour level for the CHF-JPY data Fig.^. It is apparent that the data is 
not described by an ellipse as prescribed by a Gaussian distribution, but rather the contour line 
takes the shape of a "bean". For the fairly large r events that fall outside the 90% level, events 
with different signs of ri,r2 are less likely to occur, or equivalently events which have same signs 
of ri,r2 are more likely to occur, compared to an ellipse described by the multivariate Gaussian 
distribution. At the 50% level, the same "bean" -like structure is found. Similar comments apply 
to the UKP-JPY pair. This "bean" structure is thus the signature of a stronger correlation of the 
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sign of the returns, i.e. in the trend of the US $ against these other currencies, for relatively small 
returns compared to the large returns for which this correlation weakens. 

For the RUR-JPY data, the limited statistics makes it hard to state anything decisive concerning the 
large r bivariate distribution, while the marginal distributions are clearly strongly non-Gaussian. 
However, the small r data seems not incompatible with Gaussian behavior, both for the marginal 
and bivariate distributions. This illustrated in figure @ that shows a close-up of the countour lines 
of the bivariate distributions. For small |r|(< 0.5%), the contour lines in all five cases appear either 
as approximate circles or ellipses with principal axis along the coordinates, thereby suggesting 
uncorrelated multivariate Gaussian behavior for small changes of exchange rate. 

The CAD-JPY contour lines shown in Fig.^ have a diamond like shape with principal axis along 
the coordinates. This signals that correlations are very weak and that, compared to an ellipse, the 
CAD-JPY data shows a higher probability for having events in which one of the returns is small. 
In other words, either one or the other exhibits a large move but rarely the two together. 

The MYR- JPY data looks similar except that the diamond principal axis appear tilted and "twisted" . 
The tilt reflects the fact that the MYR- JPY pairs are correlated, significantly more so than the 
CAD-JPY pair. 



5.1.2 Multivariate Gaussian distributions of the transformed variables yi 



In Figures [7|j8|,3, |lO| , |ll| , we plot the bivariate distributions -P(y) defined by Eg . (P0|) , obtained from 
Figures m2|,^,^j5| using the transformation Eg. (|l6[) . In each figure, we also construct the marginal 
distributions obtained by the transformation Eg.(p!^) performed on the empirical data points. The 
continuous lines represent the parabola (in this semi-log representation) corresponding to a Gaus- 
sian of unit variance. It is not surprising to find an excellent agreement between the empirical 
marginal distributions of the y variables and the Gaussian pdf, as the mapping (|l6| ) ensures this 
correspondence. We note however some discrepancy near y = 0, due to the large number of guotes 
with zero or very small absolute values of the returns and the limited accuracy of the data (the 
guotes are measured with a finite number of digits, in units of points). 

The contour lines for the bivariate distributions of the y variables are defined as in Figures 0,^,^,^,^ 
Let us mention that, due to the phenomenon just mentioned of the abnormal behavior of P{r) near 
r = 0, there is a high degeneracy of the points near y = 0, i.e. one single dot may correspond to 
many events near y = 0. 



Since the transformation Eg. (16) ensures that Vn ~ V22 ~ 1, it is easy to show that the contour lines 



should be ellipses tilted at an angle of ±45 degrees (for correlated/anticorrelated V12 respectively) 
with a ratio of the small over large pricipal axis of the ellipse given by v^(l — |Vi2|)(l + |^12|)- Visual 
inspection of the Figures |^J^,|lO|,^,|ll| confirms that the transformation (^) , which is exact for each 
of the marginal distribution, provides an excellent representation of the bivariate distributions. 
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5.1.3 Goodness of the nonlinear transformation and statistical tests 



The transformation Eq.(16) offers a new way for characterizing marginal distributions and their 



deviations from normality. 



In Fig. 12, we represent the transformation Eq.(16) for all six cm'rencies. The r-variables are 



calculated from the empirical data. From each data point, the transformation Eq.(|l6D provides 
the corresponding y value. The negative returns have been folded back to the positive quadrant 
(by taking their absolute values). The double logarithmic representation (log-log plot) is useful to 
characterize a power law (^, corresponding to marginal distributions in r given by the Weibull pdf 

Two straight lines are drawn on the figures. The y = r line corresponds to the exponent c = 2 
in Eq.(^) and Eq.(^), which would qualify a Gaussian pdf. The other straight line shown for 
each currency describes the transformation law (^), where the exponents c have been determined 
independently by fitting the tails of the marginal distributions reported in Figures |J2|,|,|J|. Thus, 
the Weibull distributions Eq.(^) correspond to exact power laws Eq.(^, translating into straight 



lines of slope c/2 in the log-log representation of Fig. 12. Without any adjustment of the exponent, 
we find an excellent consistency of the description in terms of the modified Weibull distributions 
for large returns. 



For smaller returns, we observe in Figure 12 that the y{r) curve bends down to become approxi- 
mately parallel to the line y = r suggesting again, in agreement what was found above from the 
contour lines of the bivariate distributions, that the distributions are approximately Gaussian for 
small absolute returns. 

We now present a statistical test for the reliability and "goodness of fit" of the representation 
Eq.([20|) of empirical multivariate distributions. We plot in thick line in Fig.l3a-e the cumula- 



tive distribution for N = 2 degrees of freedom versus the fraction of events as shown in Figures 
|7|j8|,p,lC,p!l within an ellipse of equation = y'^~^y- If the empirical distributions strictly fol- 
lowed the pdf Eq.(pOD, one should have a straight line. The thin lines correspond to the 90% 



confidence levels obtained from 100 Monte Carlo simulations : 100 random realizations with the 
same number of points as the empirical data, taken from a Gaussian multivariate distribution, are 
transformed into the representation Eq.(^). The thick line is found close to the diagonal, implying 
that the empirical -P(y) is very well approximated by a multivariate Gaussian. There is however 
a small but statistically significant departure from the diagonal, mostly in the upper parts of the 
figures, corresponding to the small returns. This seems to tell us that the correlation has a different 
structure for small and moderate returns compared to the larger returns, again in agreement with 
the Weibull (resp. Gaussian) representation of large (resp. small) returns. The largest deviation 
can be seen for the RUR-JPY data, which is to be expected since the statistics is the poorest for 
this set. 



In Fig. 13 f, we construct a similar plot for the multivariate distributions with the = 6 currencies 
taken all together. Since the empirical determination of the multivariate distribution requires us to 
keep only the quotes for all six currencies occurring simultaneously on the same day, this statistics 



for the multidimensional plot of Fig. 13 f is necessarily the poorest and is controlled by the poorest 



of all bivariate distributions which is the RUR-JPY set. Fig.l3f shows that the method works well 
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also with more than N = 2 assets. 



We have checked for the stationarity of these results by dividing the total time window into three 
sub- windows of equal lengths and have performed the same analysis using Eg. ([l^) . We find that the 



representation Eq.(pOD tested in Fig. 13 is extremely robust: no statistically significant differences 
are found between the three sub-windows. The stationarity of the nonlinear covariance matrix V 
is quite remarkable. 



The same cannot be said with a represention in terms of the return r variables. Fig. 14 illustrates 
this fact by constructing the same tests as shown in Fig.(|l^) but for the returns r instead of 
the transformed variables y. One can observe the dramatic departure from the diagonal, with 
an extremely large statistical significance, confirming the highly non-gaussian structure of the 
empirical multivariate distribution. This figure is presented to stress (1) how far from Gaussian 
are the empirical distributions and (2) how good is our transformation Eq.(^) in comparison. 
The transformed variable y is indeed the correct one to work with to get a robust and stationary 
representation. 

Notice that, in Fig.^ the thick line does not approach the diagonal near abcissa values close to 
1, as one could naively expect from our argument that the empirical distribution are not far from 
Gaussian for small returns. The reason is that the distribution may be close to Gaussian for small 
returns, but with a different covariance matrix. This interpretation is born out by the observed 
deviations from the diagonal in Figjl^ occurring for small y's. 

The fact that the thick line in Fig.^ is below the diagonal for most of the range of can be 
explained as follows. The correlation matrix of the returns is estimated over all returns, including 
the large realizations. Therefore, for a given value of x^, the corresponding ellipse is much larger 
than the ellipse that would be estimated from the covariance matrix (let us call it R) of the small 
returns. For small values of r (close to 1 on the abcissa of Fig.|l^, there are therefore more points 
inside, i.e. fewer points outside, than one should expect for an ellipse evaluated from the small 
returns. Another way to see this result is to recall that y k, r for small r as shown in Fig.|l^. Since 
R^^ is much smaller than (think of a matrix with only diagonal elements), is therefore 
much smaller for small r-values than for small y-values. This implies that, for small r, the integral 
on the abcissa stays approximately constant as one decreases the fraction of events outside the 
ellipse on the ordinate. In comparison in Fig. 14, since is much larger than R~^, is larger 



and the integral on the abscissa decreases with the same rate as the fraction of events outside the 
ellipse on the ordinate. 



5.2 Variance and excess kurtosis 

Consider one of the five portfolios with two currencies investing a fixed fraction p of its wealth W 
calculated in US$ in currency 1 and the remaining fraction 1 — p in currency 2, where 1 and 2 refers 
to the five pairs of currencies analyzed in the previous figures. Using the historical time series, we 
construct numerically the time series for the portfolio value W{t) from the recursion 

W{t + 1) = pW{t)si{t) + (1 - p)W{t)s2{t) (102) 
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with p fixed. This expression ensures that the fraction of the wealth invested in a given currency is 
constant. We do not address here the issue of friction and transaction costs that would be involved 
in this dynamical reallocation but rather focus on the tests of the theory. 



The annualized daily return rw of W{t) is defined by rw{t) = 250 ln^^^^|^^^. Fig. 15 shows the 
dependence as a function of p of the variance 

C2 = {{rw - {rw)f) , (103) 

of the excess kurtosis 



C4 ^ {{rw - {rw)) 
4 {{rw - {rw)) 
and of the sixth-order normalized cumulant 



"-^ _ \i' w - V w n I o /in/i^ 



Ae = ^ , (105) 

of the daily portfolio returns. The excess kurtosis and the sixth-order normalized cumulant quantify 
the deviation from a Gaussian distribution and provide measures for the degree of "fatness" of the 
tails, i.e. a measure of the "large" risks. Taking into account only the variance and the excess 
kurtosis and neglecting all higher order cumulants, a distribution can be approximated by the 
following expression valid for small excess kurtosis [Sornette, 1998] 



P{rw) — exp 



{rw - {rw))'^ f ^ ^K{rw-{rw))" 



2C2 V 12 C2 



(106) 



The negative sign of the correction proportional to k means that large deviations are more probable 
than extrapolated from the Gaussian approximation. For a typical fluctuation \rw — {rw)\ ~ \/c2, 
the relative size of the correction in the exponential is j^. For large values of k, this approximation 
dlOq) break down and the deviation from a Gaussian is much more dramatic. 



The most interesting feature of Fig. 15 is that the weight that minimizes the variance does not 
correspond to the minimum of k or of Xq. This illustrates one of the main message of our work, 
derived in section 4 analytically for non-Gaussian distributions : minimizing the portfolio variance is 
not optimal with respect to the large risks. The strength of this effect depends on the relative shape 
of the distributions of the assets constituting the portfolio and their correlations. The strongest 



effect in Fig. 15 is found for the Malaysian- Yen pair, for which p ~ 0.5 minimizes the variance 



but gives an almost four-fold increase of the excess kurtosis compared to its minimum. Fig. |15 
shows that it is possible to do much better and construct a portfolio which has not much more 
"small" risks (as measured by the variance) while having significantly smaller "large" risks (as 
measured by the excess kurtosis and six-order normalized cumulant). This is due to the fact that 
the excess kurtosis and the six-order normalized cumulant have in general a direct or inverted S- 
shape with a rather steep dependence or narrow well as a function of the asset weight p, while the 
variance exhibits comparatively a smoother and rather slower dependence. The investor will be 
well-inspired in using the additional information provided by the higher-order cumulants to control 
for the portfolio's risks. 

It is also very interesting to observe that k and Ag exhibit similar behaviors with minima occurring 
essentially for the same weight. This confirms a prediction of our theory, according to which the 
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asset weights that minimize X2rm irrespective of the order 2m for m > 1, are given by Eq.(p4D. 
Notice that this prediction holds in principle only for uncorrelated assets, while the correlation 
coefficients are p{yi,y2) = 0.57 (CHF-JPY); 0.43 (UKP-JPY); -0.06 (RUR-JPY); 0.07 (CAN- 
JPY); 0.32 (MYR-JPY), where p is defined by (|m|). The case where the correlations are weak 
(RUR-JPY and CAN-JPY) exhibit the best agreement with our prediction. The agreement is still 
reasonable for the three other cases with larger correlations. The main lesson learned here is that 
such correlations are not very important for "fat tail" distributions (c < 1.5) in the determination 
of cumulants of large orders (in practice larger than two) . 



Figjlqa-b and d-e compare the empirical excess kurtosis (fat solid line) shown in Fig. 15 for the five 



portfolios to our theoretical prediction (40) with (??) (solid line). We use the result for uncorrelated 
assets as the coefficient of correlations are small for two of the five portfolios. In addition, we find 
that the empirical exponents c are much less than 2 for which the calibration of the exponents 
c plays a much more important role in the determination of the high-order cumulants than the 
correlation. We leave for a future paper the extension of our theory to the case of correlated assets 
with different exponents c. The exponents ci and C2 are those determined in the fits of the pdf 's tail, 



as given in Fig. 12. There is thus no adjustable parameters in the comparison shown in Fig.l6a-b 
and d-e. The thin solid lines and dashed lines plot the theoretical formula ( ^0| ) for values of the 
exponents Oi = Cj/2 it 0.05, so as to provide uncertainty brackets for the comparison. 

FigJT^c (RUR) shows the empirical excess kurtosis (fat solid line) and the theoretical prediction 
( |40D with the fixed exponents ci and C2 given in Fig.|l2|. The thin solid line gives the predicted 
excess kurtosis for exponents ci + 0.05, C2 ± 0.05. A better agreement is observed for an exponent 
ci + 0.05 for the Russian currency slightly larger than determined from fitting the tail of its pdf. 
We interpret this result by the fact that this fat tail pdf embodies the effect of much higher-order 
cumulants, while the excess kurtosis is relatively a still rather low-order cumulant. 

Figjl^ compares the empirical excess kurtosis (fat solid line) of the portfolio CHF-JPY (FigJI^a) 
to the prediction (|65|) with (|6^) for correlated assets with the fixed exponents ci = C2 = 2/3. The 
thin solid line correspond to the empirical value p{yl,y2) = 0.57 while the dashed line is obtained 
from the same formula with p{yl,y2) = 0. The lesson we learn here by comparing the dashed and 
thin solid line together with the solid line of Fig.^^a is that the existence or absence of a correlation 
for "fat tail" distributions is not very important for the determination of the excess kurtosis. Much 
more important is the correct determination of the tail exponents c of the Weibull distributions. 

In summary, we find a good agreement between the empirical excess kurtosis and our prediction 
with (]40| ) with fixed exponents ci and C2 determined in Fig.^. The other theoretical curves provide 
the range of uncertainty in the determination of the excess kurtosis coming from measurement errors 
in the exponents c. In fact, the fits with the predicted excess kurtosis look amazingly good, because 
even minor effects like the double well structure in Fig. 16c near p = and in Fig.^^d near p = 1 



are in accordance with theory. These results are very consistent : a bad choice of c leads to a bad 
fit of the pdf tails of each asset constituting the portfolio and change completely the kurtosis of the 
portfolio pdf, as can be seen from its large sensitivity under relatively small variations of ±0.1 of 
the exponent c. 

The main point here is that the theory adequately identifies the set of portfolios which have small 
excess kurtosis and thus small 'large risks' and still reasonable variance ('small risk'). We stress 
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the importance of such precise analytical quantification to increase the robustess of risk estima- 
tors : historical data becomes notoriously unreliable for medium and large risks for lack of suitable 
statistics. 

6 Conclusion 

We have presented a novel and general methodology to deal with multivariate distributions with 
non-Gaussian fat tails and non-linear correlations. In a nutshell, our approach consists in project- 
ing the marginal distributions onto Gaussian distributions, through highly nonlinear changes of 
variables. In turn, the covariance matrix of these nonlinear variables allows us to define a novel 
measure of dependence between assets, coined the "nonlinear covariance matrix" , which is specifi- 
cally adapted to remain stable in the presence of non-gaussian structures. We have then presented 
the formulation of the corresponding portfolio theory which requires to perform non-Gaussian in- 
tegrals in order to obtain the full distribution of portfolio returns. We have developed a systematic 
perturbution theory using the technology borrowed from particle physics of Fcynmann diagrams to 
calculate the cumulants of the portfolio distributions, in the case where the marginal distributions 
are of the Weibull class. The main prediction is that minimizing the portfolio variance may in 
general increase the large risks quantified by the higher-order cumulants. Our detailed empirical 
tests on a panel of six currencies confirm the relevance of the Weibull description and allows us 
to make precise comparisons with our theoretical predictions. For "fat tail"' distributions, we find 
in particular that the valid determination of large risks, as quantified by the excess kurtosis, are 
much more sensitive to the correct measurement of the Weibull exponent of each asset than to their 
correlation, which appears almost negligible. 

Plenty of works remain to be done to explore further this approach. 

• The case of assets with different exponents c have been treated only for uncorrelated assets 
and the corresponding problem of heterogeneous c's in the correlated case is relevant for a 
precise comparison with empirical data. Furthermore, we have not studied assets with large 
exponents c > 1.5. The relevance of correlations increases with increasing c and we expect a 
precise determination of the correlation matrix to become more important as c ^ 2. 

• We have focused our analysis on the risk dimension of the problems by studying symmetric 
distributions, i.e. assets which are not expected to exhibit long-term trends. A natural and 
relevant extension of our theory is to treat the case where the mean return is non-zero and 
different from asset to asset. 

• The next level of complexity is to have non-symmetric distributions, with variable Weibull 
exponents c and with correlations. 

• The perturbation theory in terms of Feynmann diagrams can be used for other classes of 
distributions and it would be interesting to explore in details other potentially useful classes. 

• We have assumed and found to be reasonably verified that the nonlinear covariance matrices 
are stationary. There is however no conceptual difficulty in generalizing and adapting the 
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ARCH [Engle, 1982] and GARCH [Bollerslev et al., 1992] models of time- varying covariance 
to this formulation in terms of effective y variables. 

• Our empirical tests have been performed on small portfolios with two and six assets, with the 
purpose of a pedagogical exposition and easier tests of our theory. It is worthwhile to extend 
our work to larger and more heterogeneous portfolios. 

These goals all seem within reach and we intend to address these questions in future works. 
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7 APPENDIX A: Consistency condition for Elliptic distributions 



Elliptic multivariate distributions P{X) are defined by 

P{X) =f{{X- X^fV-^{X - X^)^ . (107) 

X^ is the unit column vector of the average returns and F is a dependence matrix proportional to 
the covariance matrix when it exists. The function F is kept a priori arbitrary (non- negative and 
normalized). If F is an exponential, ( |107| ) retrieves the normal distribution and V becomes the 
covariance matrix. 

The proposition that the results of the CAPM extends to elliptical distributions [Owen and Ra- 
binovitch, 1983; Ingersoll, 1987] relies on the "consistency" condition, according to which any 
unconditional marginal distribution is of the form F{y^ [Xi — X?)^) with the same function F. In 
particular, this leads to the fact that the distribution of the portfolio is a function solely of W'VW 
(where W is the column vector of the asset weights) with a functional form independent of the 
weights W and the number of assets in the portfolio. 

Kano [1994] has shown that this is not true for most marginal distributions. For instance, for 
F{x) = Cexp[— a/|x[] where C is a normalizing constant, the unconditional variance of a single 
variable calculated using ( |107| ) is equal to {N + l)/2 times the variance obtained for a single variable 
using the same functional form. The result is thus not independent of N . This inconsistency implies 
that the overall shape of the distribution will change as a function of the asset weights in the 
portfolio. As a consequence, W'T,W is not the sole measure of the portfolio risks. The conclusions 
of Owen and Rabinovitch [1983] and Ingersoll [1987] thus apply only to a restricted class of elliptic 
distributions for which the consistency condition apply. 

Let us show now explicity that 

1. the dependence of the portfolio distribution on its wealth variation 6S can be expressed solely 
in terms of the ratio (SS)'^ /W'VW , but 

2. the distribution itself has a functional form which, in general, is still dependent on the asset 
weights W constituting the portfolio. 

For this, we write its variation during a unit time step as 

N 

5S{t) = Wadx^it) = W'X . (108) 

a=l 

The density distribution P{6S) can be written as 

p(SS) = J dXF{{X - A°)^F-^(A - X^)^5{5S - W'X) , (109) 

where 5{x) is the Dirac distribution. To estimate this integral, we isolate one of the assets xi = 
x\ + yi and write, using Y = X — X^ , 

Y^V-^Y = V{-^^yl + 2{v^y)yi + y^V'^y , (110) 
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where V^j^ is the element ij of the matrix V ^, y is the unit column vector {y2,y3, ■■■■jUn)' of 
dimension — 1, is the unit column vector {V^^ , V^^, V^^)'^ of dimension — 1, and is 
the square matrix of dimension — 1 by A'^ — 1 derived from by removing the first row and 
first column. The factor 2 in 2{v^y)yi comes from the symmetric structure of the matrix V~^. 



We can now express the condition 5{5S — W'X) in the integral (109) : 



—5{yi-—{5S-W'Xo-W'y)), (111) 

Wl Wl 

where W is the unit column vector {w2,W3, ...,W]\fy of dimension A'^ — 1. The integration over the 
variable yi cancels out the Dirac function and we obtain the argument of the function F under a 
quadratic form in the variables S and y. Using the identity 

X'V'^X + X'Y = X'V'^X -^Y'VY , (112) 
where X = X + VY, we obtain 

Pi6S) = JdyF(y'M-^y + ^^), (113) 
where the integral is carried out over the space of vectors y of dimension A^ — 1 and 

= - —(v - ^W)W' . (114) 
Wl 2wi 



We can finally write 



/ xc2 \ 



where J-'ix) is defined by (113). 



This confirms that the typical volatility of the portfolio is controlled by the quasi-variance W'VW 
as for the normal case. It is then natural to optimize the portfolio using this measure of the 
risk. However, it is clear that, for arbitrary functional forms of F, the specific functional form 
of the distribution P{6S) is in general a function of the asset weights constituting the portfolio. 
Minimizing only p^Vp may thus be insufficient because it may be linked to a dangerous deformation 
of J-{x) in the tail. 
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8 APPENDIX B: Derivation of the approximate stability in fam- 
ily of the Weibull distributions 



Consider an asset with daily returns distributed according to (|5|) with c < 1. What is the distribu- 
tion of returns over T days? It is convenient to use the representation of the distribution in terms 
of its cumulants defined by the derivatives of the logarithm of its characteristic function : 

(116) 

k=0 

The characteristic function thus reads 

oo 

Pik) = eMT.'^{ikr}. (117) 



i-iy 



log P{k) 



Assuming the absence of correlations between successive daily returns, the distribution of returns 
over T days is the distribution of the sum of T independent random variables. Its cumulants are 
thus T times the cumulants of the distribution of the random variables 0. Adapting the results 
derived in Appendix C to the case of a single asset with "nonlinear" variance d = according to 
(|38| ) with c = 2/(7, we obtain the expression of the cumulants of the returns over T days as 

C2r{T) = Tc2r{l) = T C{r,q) (fi , (118) 

where C{r,q) is a numerical factor given by (??) in the main text and in Appendix C. This 
expression ( |118| ) is valid for any real value q, i.e. describe the case of a Weibull exponential with 
arbitrary real exponent c = 2/q. 

The deviation from the normal distribution is quantified by the normalized cumulants 

_ C2r{T) _ C{r,q) 1 
^'^'^ ' ~ [c2{T)]r - [C{l,q)Y T^'-i ■ ^''^^ 

The cumulants of order 2r larger than 2 decay to zero as T increases to infinity. This constitutes 
a signature of the central limit theory according to which the distribution of returns for the sum 
of T — > oo i.i.d. random variables tends to the normal law. In practice, the distribution of returns 
over a finite T is not exactly normal because the convergence to the normal law is rather slow. 
For instance, the excess kurtosis decays to zero only as l/T, starting from a rather large value 
k{T = 1) = 7.2. 

For finite T > 1, the distribution of returns over the time scale T may be approximated by a 
Weibull pdf with an apparent exponent ct = '2/qT larger than the exponent c determined at the 
daily time scale. For this, we approximate the first cumulants by their expression for a Weibull 
distribution with adjustable nonlinear variance dj- and different exponent ct = '2/qT- This amounts 
to look for an approximate representation of T C{r,q) d'^^ by C{r,qT) djf^. Since there are only 

■'This stems from the fact that the distribution of the sum is a convolution integral a nd i ts Fourier transform is 
then the product of the Fourier transforms of each individual pdf's. From the definition (IIFI), the result follows. 
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two variables and qx to optimize, they can be determined from two conditions, that we take to 
be the correspondence of the variance and excess kurtosis 



T C{l,q) <P = C{l,qT) , (120) 

T C7(2, q) (f'i = C7(2, qr) . (121) 
Eliminating dj- between the two equations gives qx as the solution of 

^ C{l,qT) C{2,q) 



C{l,q) C(2,qT) 



(122) 



For a distribution of daily returns given by a Weibull distribution with exponent ci = 2/3 (g = 3), 
we find that the monthly (T ~ 25) returns are distributed according to an exponential (C25 ~ 1). 
This approximation illustrates the very slow convergence to the normal law since the value C25 ~ 1 
is still very far from the asymptotic normal value Cqo = 2. 

The effective variables (It and qx can be determined by more global conditions such that the 
weighted sum of the square of the differences T C{r,q) dF"^ — C{r,qT) djf^ be minimum over a 
certain set of r's, this set controlling how far in the tail the approximation is valid. 

We test this idea by the following synthetic tests. Let us call ci = 2/qi = 2/3 the exponent of the 
Weibull pdf Pi of the returns ri at the daily time scale. We construct the pdf Pt of the returns 
r-r over T days by taking the characteristic function of Pi to the T-th power and then taking 
the inverse Fourier transform |^. Let us now test whether Pt can be approximated by a Weibull 
distribution with an effective exponent ct = 2/qT and determine its value as a function of T. 

For this, we perform the change of variable r-p VT{fT) given by (|2|) with (^, with a given choice 



for the exponent ct and using ( |12C , 122 ) to get xt = dUp. If the T-fold convolution distribution Pt 
of the Weibull distribution Pi is approximately a Weibull, this change of variable should lead to an 
approximate gaussian with unit variance for the correct choice of ct- We check the consistency of 
this program for T = 1 for which we do retrieve, as expected, an exact Gaussian with unit variance 
independent of c and x- 



Figs.l7,l^jT^,|2^ plots the pdf's PT{y) as a function of z = so that a Gaussian (in the y variable) 
is qualified as a straight line (dashed line on the plots). Thus, from the series of transformations, 
a straight line qualifies a Weibull distribution. We show the cases T = 2, 4, 8 and 20 for which 
the best ct are respectively C2 = 0.73, C4 = 0.80, cs = 0.90 and C20 ~ 1.05. The other curves 
allow one to estimate the sensitivity of the representation of Pt in terms of a Weibull as a function 
of the choice of the exponent ct- These simulations confirm convincingly our proposal that a 
Weibull distribution remains quasi-stable for many orders of convolutions, once the exponent ct is 
correspondingly ajusted. We observe on Fig. 17, 18,1^,?? that the Weibull representation is accurate 
over more than five orders of magnitude of the pdf Pt- Only for the largest time scale T = 20, we 
observe significant departure from the Weibull representation. 



^We use the theorem that the Fourier transform of the convolution of two functions is the product of the Fourier 
transforms. 
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9 APPENDIX C: Calculation of the cumulants of P{6S) in the 
diagonal case 



The integral 

can be perturbatively expanded as 

li 



/+00 
-oo 



du e 



:+ikwiu'' 



- (2m!) 



r 

Jo 



due 



(123) 



(124) 



/ 

^0 



dt e-Hi"'-^ 



= (2™!) 

m=0 ^ 

We recall the definition T{x) = /q°° dte^^t^^"^ and the property T{n + 1/2) = tt^/^^^^^^^ when 
is an integer and (2n - 1)!! = (2n - l)(2n - 3)(2n - 5). ..5.3.1. 

The density Pg the refore becomes 



n 



5:(^fdDK'<^i; 

i<j 



+ ■ 



(125) 



that can be exponentiated to extract the cumulants c^(g) 



Psik) = exp 



exp 



(126) 



(4g- 1)!! _ 1 / (2g- 1)!! 
4! 2 V 2! 



E( 



Pushing the calculation to the next orders, we get the sixth and eigth cumulants 



6! 
8! 



(6g-l)!! (4g-l)!!(2g-l)!! 1 
6! 4! 2! ^ 3 



(2g-l)!! 



n 3' 



2! 



(127) 



{8q - 1)!! {6q - 1)!! (2g - 1)!! ^ {4q - 1)!! 



8! 6! 

EK^^f)'- 



2! 



4! 



(2g-l)!! 
2! 



(2g-l)!! 



2! 
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The general cumulant is found by recurrence : 



C2r{q) ^ fy^' [2(r-n)g-l]!! 
(2r)! \^^^ ' (2r-2n)! 



(29-1)!! 
2! 



i-iy 



(29-1)!! 



2! 



J2{wMy. (128) 



Note that the above computation is vahd even when q is real and the interaction term is proportional 
to sign(ui)|tii 1'^. In this case the interaction is still an odd function of u and the derivation goes 
through exactly with the same combinatorics as above. The result is 



C2r{q) 

(2r)! 



r-2 



r 



n)q + 



n=0 



(2r - 2ra)!7rV2 



2!7rV2 



(-1)^ 



r 



Ml 

2!7rV2 



(129) 



41 



10 APPENDIX D: Generalization of the extreme deviation theo- 
rem of Frisch and Sornette [1997] to obtain the tail structure 
of P{6S) in the diagonal case for c > 1 

We start from the definition 5S = J2iLi WiSxi and the corresponding equation for its probabiUty 
density function : 



Pj^{5S) = J ■■■ J e~S-i-^"(^^') 6 (^5S -Y^^WiSx^"^ d6xi ■ ■ ■ dSxN 



(130) 



N 

where we have used the parameterization 

Pi{5xi) = e-^^^'^"^) , (131) 

with 

fiiSxi) = , with c> 1 . (132) 

Xi 

All integrals in (|13C| ) are from — oo to +00. The delta function expresses the constraint on the sum. 
We need the following conditions on the functions fi (see [Frisch and Sornette, 1997] for precisions) : 

(i) fi{6xi) +00 sufficiently fast to ensure the normalization of the pdf's. 

(ii) f['{5xi) > (convexity), where /" is the second derivative of / . 

(iii) liuix^oo x"^ f" (x) = +00. 

Under these assumptions, the leading-order expansion of Pn{6S) for large 5S and finite > 1 is 
obtained by a generalization of the Laplace's method which here amounts to remark that the set 
of 5x*^s that maximize the integrant in ( 130]) are solution of 



fiiSx*) = Cn{5S) , (133) 

where C]\f{dS) is independent of i. In words, the leading behavior of Pn{SS) is obtained by the set 
of (5xi's that occurs with the same probability. The 5x* obey 

N 

J2wiSx* = SS. (134) 
1=1 

Expanding fi{6xi) around 5x* yields 

mxi) = mSx*) + a^hi + bih^ + ... (135) 
where a, = fl{5x*), bi = ^fl'{6x*) and hi = 6xi — 6x* obey the condition 

N 

Y,mh^ = 0. (136) 

1=1 
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We ignore the terms of order higher than two as they do not contribute to the leading order. We 
rewrite 



4:0 j Wf 



2bi ' 46, 



where the Hi = Wihi verify J2i=i = 0- Expression ( 130 ) then becomes 
= e 



. . ^ i=l 4b~ 



The integral in (|138| ) is evaluated by setting y = J2j=i "26^ ^^'^ ~ ^ identity 



e 

N-l 

where A is defined by 



Aj/i|-Ajv(»/-/ti /ijv-i) 



2 iV-1 

dhi ■ ■ ■ d/iAT-i = vr 2 



A 



nf=i A, 



1 ^ 1 

A ^ A,- 

This identity is obtained by viewing y as the sum of the Gaussian variables y^'s. 



For the case (132) with c > 1, the condition (|133| ) together with ( 134 ) yields 



Ax* A"? 

/.(fe:) = (^r = (^r , i.e. 



X* = 6S ^ 
X 



where 



(with c = 2/q) and 



We also have 



N N , 



46, 2(c - 1) ^ X 



1 c(c-l) f6S^^-'^ 



A 2X2 



X 



where 



This yields the contribution 



N 



= exp 



c x\SS^ 



(137) 



(138) 



(139) 



(140) 



(141) 
(142) 

(143) 

(144) 
(145) 

(146) 

(147) 
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Regrouping all terms in (|138|) leads to 



PNiSS) = TT 



N-l 

2 



X UjLiWjXj 



,5S. 



c{c - 1) X 



2-c 



JV-l 
2 



exp 



N ,6S. 



2(c-l)^ X 



where 



X 



X 



(2 - c) (S, i^'.X.)^ -(2-r\ 

^7 = 1 3^3 



(148) 



(149) 



These results are valid for c > 1. The extreme tail of the portfolio wealth distribution is thus 
controlled completely by x^ which is its characteristic decay value. 



2 

Note that -^j^ = 1 for identical assets Xi = X when all weights Wi are equal to 1/A^. In this case, 



(150) 



the exponential term in ( |14S| ) simplifies into 

-xp(-«(f r) . 
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11 APPENDIX E: Computation of the characteristic function de- 
fined by eq.( |151| ) 



The characteristic function of the distribution Ps{5S) of portfoho wealth variations for correlated 
assets with Weibull distributions is given by 

TV 

Psik) 



(27r)^/2 detyi/2 



n ( / du,) e-^ u'v-W+^k . (151) 



We recall the definition of the functional generator [Sornette, 1998] 



5V-'-. (27r)^/2detFV2 



N 



e 2 



When the integral is a Gaussian {k = 0), we get 

P|(0,0) 

With the property 

N 



we can formally express the characteristic function as 



Plik) 



ik . Wi -^Xr 1 , 



i 6 2 



Ji=0 



(152) 



(153) 



/ (5^) / (n e-^^^-^^+i:. =j(ll du,^ e-^^^-^^+i:. -^-'/(^O , (154) 



(155) 



We first consider the case q = 3. The first non- vanishing perturbative contribution for this case is 
obtained by expanding the formal expression above up to second order in k 



Plik) 



53 ^2 p ^3 



J=0 



The first order vanishes because 

53 1 



6J? 



62 



JVJ 



65-^^-^{(yj)f + 3y,,(yj),} 



is zero when J = 0. The second order contribution comes from 



6Jf SJf 



w,,v^^iyJ)] + 9(2^,2 + v,,v^,){yj),{yj), + w,,v,,{yjt + 
6yJ + 9V-iy,,y,-,} . 



(156) 

(157) 
(158) 



"We adopt here the compact notation JV J = . JiVijJj and = Vii J;. 
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By putting to zero the source term J, this expression leads to 



^l(^) = 1 - ^ E (6^* (^nf + 9vJiViiV,,VjjWj) . (159) 



2 



This result can be usefully represented with diagrams in the following way. Let us associate to each 



factor Vij the propagator diagram and to each factor ig^Wj the vertex diagram as shown in Fig. 21 
where we have defined the coupling constant 

53 = 3!fc . (160) 
The two contributions in ( |159| ) can thus be represented by propagators connecting vertices as in 



Fig, 22. Having defined the coupling constant 53 in ( |160D allows us to interpret easily the coefficient 



in front of each diagram corresponding to the two terms in the expression 

Plik) = l-glil-Y.^, {Vi,f W, + ^Y. W^Vi^V^,V,,W, \ . (161) 

The coefficient in front of each term (diagram) is equal to 1/S where S is the symmetry factor of 
each diagram with respect to permutation of lines and vertices. The first diagram has a symmetry 
under exchange of 3 propagators and 2 vertices, therefore we have 5 = 3! x 2!. The second diagram 
has a symmetry under exchange of the 2 lines forming the loop of each vertex, giving a contribution 
of 2! X 2!. It also has the symmetry under permutation of the two vertices. The total symmetry 
factor is therefore 5 = 2! x 2! x 2!. 

A systematic way to keep under control the symmetry factor is to compute it diagramatically as 
follows. The second-order derivative operator with respect to Jj and Jj is represented by the two 



vertices in the left hand side of Fig. 23. 



The J-independent term is given by pairwise combining each leg of each vertex in all the possible 



topologically inequivalent way, taking into account the multiplicity of each configuration. Fig.|23 
shows how to proceed. The coefficient in front of the vertices of the left hand side comes from the 
perturbative expansion. 

As a first step, let us consider the first leg of the first vertex. We can either contract it with another 
leg (two possibile contractions) of the same vertex or with a leg of the second vertex (three possible 
contractions). This is summarized in the first equality of Fig.^. The first contribution of the second 
equality is the result of the contraction with multiplicity 3 of the residual leg of the first vertex of 
the first diagram of the line above. Considering the possible contractions of the second contribution 
of the first equality generate the other two diagrams. At the end of the combining procedure, we 
have two inequivalent diagrams, with multiplicities 3! and 3^. The resulting coefficient in front of 
each diagram can now be interpreted in terms of the symmetries of the diagram as anticipated. 

These rules can be generalized to higher orders and other "interaction" terms, i.e. WiU^ for 
general value of q or even generic functions f{ui) which admit expansions in power series of Uj. As 
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an example, we give the first order correction to the characteristic function for generic value of q 
odd. 



The perturbative expansion is 



+ 



5" 51 

J 



(162) 



j=o 



where we have defined the auxiliary coupling constant gq = qlk. Now, instead of taking explicitely 
the derivatives and keep the term independent on J as the result, let us use the diagrammatic 
formalism. At this order, we have two vertices each with q legs, representing the partial derivatives 
with respect to Jj and Jj and to which we associate the value igqWi and igqWj respectively. Now, 
let us combine pairwise each leg of the two vertices to form all the possible topologically inequiv- 
alent diagrams. We have collected in Fig. 24 the sequence of all the diagrams. Each diagram is 
characterized by the number / of loops on each vertex and the number g — 2Z of lines connecting 
the two vertices giving therefore a contribution 



w 



3 ' 



(163) 



where each loop around vertex i contributes to a factor Vu and each propagator connecting the 
vertices i and j gives a factor Vij. It is now easy to determine the symmetry factor: we have 
I symmetries under the exchange of the two lines of each loop at each vertex ((2!)' x (2!)'), the 
symmetry of the / loops at each vertex ((/!) x (/!)), one symmetry under the exchange of the q — 21 
internal propagators {{2q — /)!), and the reflection symmetry under the exchange of the two vertices 
(2!). The total factor is therefore Sn = {q — 2I)!(2!)^'"''^(/!)^ and the characteristic function up to 
second order in k reads 



Plik) = 1 



('?-l)/2 

z=o 



1 



1 



1 



{q-2l)\{2 



})2l + l (/!^|2 



(164) 



The diagrammatic expansion becomes very useful for calculating the higher cumulants. A well 
known result of diagrammatic perturbation theory tells us that neglecting the disconnected dia- 
grams, which do not occur at second order, corresponds to compute the logarithm of characteristic 
function. Therefore, the set of connected diagrams at m-th order give us directly the m-th cumulant 
coefficient Cm- 

Let us give as an explicit example the computation of the fourth cumulant 04(3). In fig. 25, the 
result of the contraction procedure is shown, where we keep only the connected diagrams. It is 
explicitely 

C4(3) = 4!(3!)^ ^n'»^i2^«i3^i4 |^^j?i2^ni3^i2i4^i3i3'^i4i4 + 



*li«2,«3:«4 

J_T/.2 y. . y. . y. . y. . < J_T/.2 y , y , y2 _|_ 

23 «1*2 *1«3 '^«2«3 '^«3«4 ''1414 ^ 24 *1*2 *1«3 '^«2«4 ^1314 ^ 

2j23^1»2^li3^1M^2i2^3«3^4i4 ~^ ^ ^1^2 ^1»3 ^IM ^2*3 ^2*4 ^3«4 



(165) 



47 



We can thus formally generalize the result for the m-th cumulant as 



1 m m 

c„(g) = m!(g!r E ^n---^.™ E WT^^ 11 (^M.)'^ 11 (^M.r= (166) 

where Gm{Q) is the set of all the topologically inequivalent connected diagrams with m vertices of q 
legs. Each diagram is characterized by the number of loops at each vertex, {Ir}, and the number of 
lines connecting each couple of vertices, {rirs = nsr}- These numbers have to satisfy the constraints 

2lr + ^'^nrs = q , (167) 

which embody the fact that each vertex has q legs. The symmetry factor S is obviously the most 
difficult part to determine. The safest procedure is to compute it with the contraction rule case by 
case. For each diagram, it is of the form 

m m 

5({/r},KJ) = (2!)Sr.i''-n^r! n nrs\S,{{lr},{nrs}) . (168) 

r=l r<s=l 

Each factorial comes from the various symmetries under the exchange of the propagators, and we 
have isolated the contribution of the residual symmetries of the diagram under exchange of the 
vertices. Fig. 26 summarizes the vertex symmetry factors for the diagrams contributing to 04(8). 



48 



REFERENCES : 

R. Arad Wiener, The implications of a long-tailed distribution structure to portfolio selection and 
capital asset pricing, PhD thesis, Princeton University (1975). 

V. Bawa, E.J. Elton and M.J. Gruber, Journal of Finance, vol. XXXIV, 1041-1047 (1979). 

T. BoUerslev, R.Y. Chous and K.F. Kroner, ARCH modeling in Finance - A review of the theory 
and empirical evidence, J. Econometrics 52, 5-59 (1992). 

J. -P. Bouchaud, D. Sornette, C. Walter and J. -P. Aguilar, Taming large events : Optimal portfolio 
theory for strongly fluctuating assets. International Journal of Theoretical and Applied Finance 1, 
25-41 (1998). 

S. Cambanis, S. Huang and G. Simons, On the theory of elliptically contoured distributions. Journal 

of Multivariate Analysis 11, 368-385 (1981). 

P. Embrechts, C. Kliippelberg and T. Mikosh, Modelling extremal events (Springer- Verlag, Appli- 
cations of Mathematics 33, 1997). 

R.F. Englc, Econometrica 50, 987 (1982). 

E. Fama, Management Science 11, 404-419 (1965). 

U. Frisch and D. Sornette, Extreme deviations and applications, J. Phys. I France 7, 1155-1171 
(1997). 

C. C. Geczy, Some generalized tests of mean-variance efficiency and performance, working paper 

B.V. Gncdcnko and Kolmogorov, A.N., Limit distributions for sum of independent random vari- 
ables, Addison Wesley, Reading MA (1954). 

E.J. Gumbel, Statistics of extremes (Columbia University Press, New York, 1960). 
M.J.R. Healy, Matrices for statistics (Clarendon Press, Oxford, 1986). 

J.E. Ingersoll, Jr., Theory of financial decision making (Totowa, N.J. : Rowman k. Littlefield, 1987). 

B. Jorgensen, Exponential dispersion models, J. R. Statist. Soc. B 49 (2), 127-162 (1987). 

Y. Kano, Consistency property of elliptical probability density functions. Journal of Multivariate 
Analysis 51, 139-147 (1994). 

D. Karlen, Using projections and correlations to approximate probability distributions, Computer 
in Physics 12, 380-384 (1998). 

J. Laherrere and D. Sornette, Stretched exponential distributions in Nature and Economy: "Fat 
tails" with characteristic scales, European Physical Journal B 2, 525-539 (1998) 

P. Levy, Theorie de I'addition des variables aleatoires (Gauthier Villars, Paris, 1937-1954). 

R. Litterman and K. Winkelmann, Estimating covariance matrices. Risk Management Series, Gold- 
man Sachs (1998). 

B.B. Mandelbrot, Journal of Business, 36, 394 (1963) 



49 



B. B. Mandelbrot, Fractals and scaling in finance : discontinuity, concentration, risk : selecta volume 
E (New York : Springer, 1997). 

H. Markovitz, Portfolio selection : Efficient diversification of investments (John Wiley and Sons, 
New York, 1959). 

R. C. Merton, Continuous-time finance, (Blackwell, Cambridge, 1990). 

S. Mittnik and S.T. Rachev, Reply to comments on 'Modelling asset returns with alternative stable 
distributions' and some extensions, Econometric Reviews 12, 347-389 (1993). 

J. Owen and R. Rabinovitch, On the class of elliptical distributions and their applications to the 
theory of portfolio choice. The Journal of Finance 38, 745-752 (1983). 

C. R. Rao, Linear statistical inference and its applications, 2d ed. (New York, Wiley, 1973). 
P. A. Samuelson, J. Financial and Quantitative Analysis Vol.2, pp. 107-122 (1967). 

D. Sornette, Large deviations and portfolio optimization, Physica A 256, 251-283 (1998). 

J. von-Neumann and O. Morgcnstern, Theory of games and economic behavior (Princeton, Prince- 
ton University Press, 1944) 

M. Veltman, Diagrammatica, the path to Feynman diagrams (Cambridge University Press, 1995). 



50 



1 1 , , , , , , J 

1000 2000 3000 4000 5000 6000 7000 




Figure 1: Bivariate distribution of the daily annualized returns of the CHF in US $ (i = 1) and of 
JPY in US $ (j = 2) for the time interval from Jan. 1971 to Oct. 1998. One fourth of the data points 
are represented for clarity of the figure. The contour lines define the probability confidence level of 
90% (outer line), 50% and 10%. Also shown are the time series and the marginal distributions in 
the panels at the top and on the side. The parameters for the fit of the marginal pdf's are: CHF 
in US $: Ai = 250, ci = 1.14, roi = 2.13 and JPY in US $: A2 = 350, C2 = 0.8, ro2 = 1.25. 
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Figure 2: Bivariate distribution of the daily annualized returns of the UKP in US $ (i = 1) and of 
JPY in US $ (i = 2) for the time interval from Jan. 1971 to Oct. 1998. One fourth of the data points 
are represented for clarity of the figure. The contour lines define the probability confidence level of 
90% (outer line), 50% and 10%. Also shown are the time series and the marginal distributions in 
the panels at the top and on the side. The parameters for the fit of the marginal pdf's are: UKP 
in US $: Ai = 250, ci = 1.14, roi = 1.67 and JPY in US $: A2 = 350, C2 = 0.8, ro2 = 1.25. 
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Figure 3: Bivariate distribution of the daily annualized returns of the RUR in US $ (z = 1) and 
of JPY in US $ (i = 2) for the time interval from Jun. 1993 to Oct. 1998. One fourth of the data 
points are represented for clarity of the figure. The contour lines define the probability confidence 
level of 50% (outer line) and 10%. Also shown are the time series and the marginal distributions in 
the panels at the top and on the side. The parameters for the fit of the marginal pdf 's are: RUR 
in US $: Ai = 80, ci = 0.38, roi = 0.83 and JPY in US $ ^2 = 120, C2 = 0.8, ro2 = 1.25. 
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Figure 4: Bivariate distribution of the daily annualized returns of the CAD in US $ {i = 1) and of 
JPY in US $ (i = 2) for the time interval from Jan. 1971 to Oct. 1998. One fourth of the data points 
are represented for clarity of the figure. The contour lines define the probability confidence level of 
90% (outer line), 50% and 10%. Also shown are the time series and the marginal distributions in 
the panels at the top and on the side. The parameters for the fit of the marginal pdf 's are: CAD 
in US $: Ai = 250, a = 0.98, m = 0.59 and JPY in US $: A2 = 350, C2 = 0.8, ro2 = 1.25. 
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Figure 5: Bivariate distribution of the daily annualized returns of the MYR in US $ (z = 1) and of 
JPY in US $ (i = 2) for the time interval from Jan. 1971 to Oct. 1998. One fourth of the data points 
are represented for clarity of the figure. The contour lines define the probability confidence level of 
90% (outer line), 50% and 10%. Also shown are the time series and the marginal distributions in 
the panels at the top and on the side. The parameters for the fit of the marginal pdf 's are: MYR 
in US $: Ai = 150, a = 0.56, roi = 1.00 and JPY in US $: A2 = 350, C2 = 0.8, ro2 = 1.25. 
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Figure 6: Close-up of the countour lines of the bivariate distributions. One can observe that 
the countour lines at the center are not far from elliptical while they depart more and more from 
ellipses for larger levels. The corresponding probability levels are: CHF: 0.44, 0.24, 0.13, 0.07, 0.04 
UKP: 0.48, 0.30, 0.17, 0.09, 0.07 RUR: 0.66, 0.56, 0.43, 0.36, 0.34 CAD: 0.60, 0.37, 0.22, 0.12, 0.10 
MYR: 0.42, 0.32, 0.25, 0.20, 0.17 



56 




-4-2 2 




Figure 7: Bivariate distribution -P(y) obtained from Fig.JPYCHF(r) using the transformation 
Eq.(p^). The contour hues are defined as in Fig.JPYCHF(r). The upper and right diagrams show 
the corresponding projected marginal distributions, which are Gaussian by construction of the 
change of variable Eq.(|l6|). The solid lines are fits of the form j4exp(— |yp/2) with Ai = 200,^2 = 
220. 
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Figure 8: Bivariate distribution P{y) obtained from Fig.y using the transformation Eq.(|l6D. 
The contour hues are defined as in Fig.|. The upper and right diagrams show the corresponding 
projected marginal distributions, which are Gaussian by construction of the change of variable 
Eq.([ll). The solid lines are fits of the form ^ exp(-|?/| V2) with Ai = 250, A2 = 220. 
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Figure 9: Bivariate distribution P{y) obtained from Fig.^ using the transformation Eq.(|l6D. 
The contour hues are defined as in Fig.|. The upper and right diagrams show the corresponding 
projected marginal distributions, which are Gaussian by construction of the change of variable 
Eq.([ll). The solid lines are fits of the form ^4 exp(-|?/p/2) with Ai = 50,^2 = 50. 
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Figure 10: Bivariate distribution -P(y) obtained from FigJ^ using the transformation Eq.(|T6|). 
The contour hues are defined as in Fig.|. The upper and right diagrams show the corresponding 
projected marginal distributions, which are Gaussian by construction of the change of variable 
Eq.([ll). The solid lines are fits of the form ^ exp(-|?/| V2) with Ai = 380, A2 = 220. 
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Figure 11: Bivariate distribution -P(y) obtained from Figjs] using the transformation Eq.(|T6|). 
The contour hues are defined as in Fig.|5[ The upper and right diagrams show the corresponding 
projected marginal distributions, which are Gaussian by construction of the change of variable 
Eq.([ll). The solid lines are fits of the form ^ exp(-|?/| V2) with Ai = 380, A2 = 220. 
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Figure 13: a-e: cumulative distribution for N = 2 degrees of freedom versus the fraction of 
events shown in Figures @,|,0J|, outside an ellipse of equation = y'V-^)y. a) CHF-JPY, b) 
UKP-JPY, c) RUR-JPY, d) CAD-JPY, e) MYR-JPY. f) same plot as a)-e) but for = 6 degrees 
of freedom for the data set CHF-UKP-RUR-CAD-MYR-JPY. 
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Figure 14: Same as Fig. (|T^) but for the returns r. a-e: cumulative distribution for N = 2 
degrees of freedom versus the fraction of events shown in Figures |l|j2|,Q,^ |5| outside an ehipse of 
equation = r'V-^)r. a) CHF-JPY, b) UKP-JPY, c) RUR-JPY, d) CAD-JPY, e) MYR-JPY. f) 
same plot as a)-e) but for = 6 degrees of freedom for the data set CHF-UKP-RUR-CAD-MYR- 
JPY. 
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Figure 15: Variance (thick solid line), excess kurtosis k (thin sohd hne) and sixth-order normalized 
cumulant Ag as a function of the weight p of currency 1 (the weight of currency 2 is 1 — p) . for the 
data sets: a) CHF-JPY, b) UKP-JPY, c) RUR-JPY, d) CAD-JPY and e) MYR-JPY. k is divided 
by 2 and Ae is divided by 300. 
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Figure 16: Comparison of the empirical excess kurtosis (fat solid line) shown in Fig. for the 
five portfolios to our theoretical prediction ( |40| ) with (??) for uncorrelated assets (solid line). The 
exponents ci and C2 are those determined in the fits of the pdf's tail, as given in Fig.|l2[ The 
thin solid lines and dashed lines plot the theoretical formula (^) for values of the exponents 
= Ci/2 ± 0.05. Figure c (RUR) is the same as a-b and d-e but the thin solid line gives the 
predicted excess kurtosis for exponents ci + 0.05, C2 ± 0.05. Figure f compares the empirical excess 
kurtosis (fat solid line) of the portfolio CHF-JPY (Figure a) to the prediction (|65|) with (^) for 
correlated assets with the fixed exponents ci = C2 = 2/3. The thin solid line correspond to the 
empirical value p{yl,y2) = 0.57 while the dashed line is obtained from the same formula with 
p{yl,y2)=0. 
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Figure 17: Plots of the pdf's Priy) as a function of z = so that a Gaussian (in the y variable) 
is qualified as a straight line (dashed line). In turn, by the construction explained in the text, a 
straight line qualifies a Wcibull distribution. Here is shown the case T = 2 for which the best ct 
is C2 = 0.73. The other curves allow one to estimate the sensitivity of the representation of Pt in 
terms of a WeibuU as a function of the choice of the exponent ct- The curves have been normalized 
by a coefficient Aj,, with A2 = 10 for C2 = 0.66 and A2 = 8 for C2 = 0.73 and 0.8. 
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Figure 18: Plots of the pdf's Priy) as a function of z = y"^ so that a Gaussian (in the y variable) 
is qualified as a straight line (dashed line). In turn, by the construction explained in the text, a 
straight line qualifies a Wcibull distribution. Here is shown the case T = 4 for which the best ct 
is C4 = 0.80. The other curves allow one to estimate the sensitivity of the representation of Pt in 
terms of a WeibuU as a function of the choice of the exponent ct- The curves have been normalized 
by a coefficient Aj,, with A4 = 25 for all C4S. 
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Figure 19: Plots of the pdf's Priy) as a function of z = y^ so that a Gaussian (in the y variable) 
is qualified as a straight line (dashed line). In turn, by the construction explained in the text, a 
straight line qualifies a Weibull distribution. We show here the case T = 8 for which the best ct 
is cs = 0.90. The other curves allow one to estimate the sensitivity of the representation of Pt in 
terms of a Weibull as a function of the choice of the exponent ct- The curves have been normalized 
by a coefficient A^, with As = 40 for cs = 0.8 and 1.0 and As = 360 for cs = 0.9. 
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Figure 20: Plots of the pdf's Priy) as a function oi z = so that a Gaussian (in the y variable) 
is qualified as a straight line (dashed line). In turn, by the construction explained in the text, a 
straight line qualifies a WeibuU distribution. We show here the case T = 20 for which the best ct 
is C20 ~ 1-05. The other curves allow one to estimate the sensitivity of the representation of Pt in 
terms of a Weibull as a function of the choice of the exponent ct- The curves have been normalized 
by a coefficient A^, with ^20 = 4.7 10^ for C20 = 1-0, A20 = 3.5 10^ for C20 = 1-05 and A20 = 7 10^ 
for C20 = 1-25. 
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Figure 21: We associate to each factor Vij the propagator diagram and to each factor ig^Wj the 
vertex diagram as shown in this figure, where we have defined the coupling constant = 2>\k. 
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Figure 22: The two contributions in ( |159| ) are represented by propagators connecting the vertices 
as shown in the figure. 
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Figure 23: A systematic way to keep under control the symmetry factor is to compute it diagra- 
matically as shown in this figure. The second-order derivative operator with respect to Jj and Jj is 
represented by the two vertices in the left hand side of the figure. The J-independent term is given 
by pairwise combining each leg of each vertex iS all the possible topologically inequivalent way, 
taking into account the multiplicity of each configuration. The figure shows how to proceed. The 
coefficient in front of the vertices of the left hand side comes from the perturbative expansion. As 
a first sten. let us consider the first leer of the first vertex. We can either contract it with another 




Figure 24: Sequence of all diagrams obtained by combining pairwise each leg of two vertices to 
form all the possible topologically inequivalent diagrams. Each diagram is characterized by the 
number I of loops on each vertex and the number (q — 2)1 of lines connecting the two vertices 
giving the contribution (163) where each loop aiiAmd vertex i contributes to a factor Va and each 
propagator connecting the vertices i and j gives a factor Vij . 
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Figure 26: Summary of the vertex symmetry factors for the diagrams contributing to 04(8). 
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